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WO 00/58488 PCT/US00/08571 

DELIVERY OF FUNCTIONAL PROTEIN SEQUENCES 
BY TRANSLOCATING POLYPEPTIDES 

FIELD OF INVENTION 

5 

The present invention relates to methods for translocating polynucleotides and 
polypeptides between cells. More particularly, the present invention relates to use of 
translocating proteins to deliver a cell process-modifying molecule into the cell where 
the cell process-modifying molecule interacts specifically with a responsive target 
10 site. 

BACKGROUND OF THE INVENTION 

Translocating proteins are defined by their ability to cross biological 
membranes, such as cell membranes. A number of translocating proteins, have been 
described, including VP22 from Herpes Simplex Virus type 1 (G. Elliot and P. 
15 0*Hare, Cell fig, 223-233 (1997)), a fragment of the Antennapedia protein from 
Drosophila (Antp) (D. Derossi et al., Journal of Biological Chemistry 262, 10444- 
10450 (1994)), and Protein H from Streptococcus pyogenes (Axcrona et al., 
Manuscript in preparation (1999)). 

Antennapedia is a homeoprotein with a DNA binding domain composed of 
20 three alpha helices with a beta-turn separating helix 2 and 3. Experiments have 
demonstrated that a 16 amino acid peptide corresponding to the third helix, named 
Antp, can translocate across membranes and accumulate in the cytoplasm and 
nucleus (Derossi et al, supra). This peptide is internalized at a temperature as low as 
4°C, suggesting that endocytosis is not responsible for the internalization of the 
25 peptide. In addition, since translocation does not require classical endocytosis, Antp 
does not travel through the endosomal and lysosomal compartments. Therefore, Antp 
is resistant to proteolysis and has enhanced activity in most cellular compartments (D. 
Derossi et al., J Biol Chem 221:18188-18193, 1996). 



30 



Recent experiments showing that a reverse helix (i.e. the reverse primary 
sequence) and a helix composed of D-enantiomers can transverse plasma membranes 
at 4°C suggest that internalization of Antp involves the formation of inverted micelles 
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in the phospholipid bilayer, making entry into cells receptor-independent and energy 
free (H. Hall et al., Current Biol 6:580-587, 1996). 

The usefulness of Antp as a vector peptide has been proven successful by 
genetically fusing Antp to various peptides of interest (F. Perez et al., J Cell Sci 

5 102:717-722, 1992; F. Perez et al, Mol Endocrinol £: 1278-87, 1994; and A. 
Prochiantz, Curr Opinion Neurob 6:629-634, 1996) or by covalent linkage via 
cysteine residues (D. Derossi et al, supra). Internalization of peptides as large as 41 
amino acids and of charged phosphopeptides (B. Allinquant et al., J Cell Biol 
122:919-927, 1995) has been demonstrated in neuronal cells. In each case, the 

10 sequences fused to Antp retained their expected biological functions. Furthermore, 
Antp is the only translocating peptide that has been used to deliver oligonucleotides 
(up to 45 nucleotides in length) to cells in culture (CM. Troy et al., J Neuro 16 253- 
61, 1996; G. Elliot et al, J Virol 172:6448-6455, 1998). 

Protein H is a surface antigen of the human pathogen Streptococcus 
15 pyrogenes. Protein H is taken up by B- and T-lymphocytes and translocated to the 
nucleus. In contrast to other translocating proteins, which appear to have no effect on 
cellular function, protein H has a cytostatic effect thought to be the result of its 
association with the nuclear proteins SET and hnRNP A2/B1 (D. Derossi et al., 
supra). To date, the translocation of Protein H coupled to another molecule has not 
20 been demonstrated. 

The best studied of the translocating proteins is the Herpes Simplex Virus 
protein VP22, which has the unique ability to translocate between cultured 
mammalian cells. When cells are transfected with a plasmid encoding the VP22 
protein, the expressed protein accumulates in the cytoplasm of transfected cells and, 

25 by translocating across cell membranes, spreads to the surrounding non-transfected 
cells where it accumulates in the nuclei. This process can occur at 4°C and also 
appears to be energy-free and independent of endocytosis. When protein trafficking 
though the cell is blocked using Brefeldin A, export of VP22 can still occur. Studies 
of cytoskeletal elements during VP22 trafficking suggest that the actin cytoskeleton 

30 may be involved in export or import of VP22 (Elliot and O'Hare, supra). 
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Delivery of several functional VP22 fusion proteins has been described, 
including VP22-p53 (A. Phelan et al., Nature Biotechnology 1^:440-443, 1998)) and 
VP22-thymidine kinase (M.S. Dilber et al., Gene Therapy £: 12-21, 1999). At least 
twenty different mammalian cell types can take up a functional VP22-GFP fusion 
5 protein (Elliot and O'Hare, supra; Aints A., et al., J. Gene Med. 1:275-9, 1999; and 
Wybranietz W. A. et al, 1 Gene Med. 1:265-214, 1999), including mouse skeletal 
myoblasts that are refractory to conventional transfection techniques (Derer W. et al, 
1 Mol Med. 21: 609-6138, 1999). 

Transfection of cells with plasmid DNA has been an invaluable tool for the 
1 0 study of biological systems. A variety of transfection methods (e.g. lipids, calcium 
phosphate) exist in the marketplace; however, these methods rarely result in more 
than 50% of cells expressing a gene carried on a plasmid with which the cells are 
transfected. Since most cells do take up exogenous DNA, inefficient transfections do 
not appear to be due to inability of the DNA complex to enter the cell. The majority 
1 5 of DNA is internalized by endocytosis with very little of the internalized DNA ever 
reaching the cytoplasm or nucleus where expression takes place. Indeed, observations 
of directly injected lipid-DNA complexes suggest that movement from the endosomes 
to the cytoplasm and nucleus is the most important limitation to successful 
transfections (J. H. Richardson et al., Proc. Natl Acad. SW.J£:3 137-3 141, 1995). 
20 Consistent with this observation, peptides with membrane fusion activity, like the 
fusogenic peptide of hemagglutinin (J. Zabner et al., Journal of Biological Chemistry 
22Q.- 18997-9007, 1995), or a nuclear targeting sequence (M. Wilke et al., Gene 
Therapy 3, 1 133-1 142 (1996)) can increase transfection efficiencies in some cases. 

Thus, there is a need in the art for new and better methods for modulating 
25 expression in cells of target genes and for transfection reagents and methods of their 
use to overcome the major blocks to expression of transfected genes, i.e., degradation 
in the endosomes and the inability of DNA to enter the cell nucleus. 
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BRIEF DESCRIPTION OF THE INVENTION 

The present invention overcomes these problems in the art by providing 
method(s) for modulating a cellular process in a cell in culture by contacting such a 
cell with a cell process-modifying molecule attached to a translocating polypeptide 
5 under suitable conditions, whereby the cell process-modifying molecule is 

translocated into the cells in culture and interacts specifically therein with a target site 
responsive to the cell process-modifying molecule, thereby modulating a cellular 
process in the cell. 

In another embodiment, the present invention provides method(s) for 
10 transfecting a cell in culture with a target gene by contacting the cell under suitable 
conditions with a polynucleotide comprising the target gene attached to a 
translocating polypeptide, whereby the cell is transfected with the target gene. 

In still another embodiment, the present invention provides method(s) for 
modulating expression of a target gene product in a cell in culture that is transfected 
1 5 with the target gene under control of one or more regulatory elements by contacting 
the cell under suitable conditions with one or more regulatory agents attached to a 
translocating polypeptide, whereby the one or more regulatory agents are translocated 
into the cell and interact therein with the one or more regulatory elements, thereby 
modulating expression of the target gene product by the cell. 

20 In yet another embodiment, the present invention provides vector(s) 

comprising a polynucleotide encoding a cell process-modifying molecule attached to 
a translocating polypeptide. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a schematic drawing showing pFIN4/lacZ, which has an 
25 intervening sequence (inv) flanked by Flp recognition sites ifrt) separating the CMV 
promoter and (5-galactosidase gene (lacZ). Interaction of Flp recombinase with pFIN4 
results in the removal of the inv sequence and expression of P-galactosidase. 
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Figures 2A-D are schematic representations of the process by which a fusion 
protein composed of VP22, an anti-ATF-2 single chain antibody (sFv), and VP16 is 
delivered to the nucleus of a cell where it binds ATF-2 and activates transcription. 
Figure 2A shows the ATF-2/LexA DNA binding domain (DBD) fusion protein binds 

5 the LexA operator (Op) upstream of the minimal TK promoter and the luciferase 
reporter gene, but does not activate transcription. Figure 2B shows that the ATF-2 
sFv-VP16 fusion protein binds ATF-2 and activates transcription. Figure 2C shows 
that the CREB sFv-VP16 fusion protein does not bind ATF-2 and cannot activate 
transcription. Figure 2D shows that the fusion protein composed of VP22, the ATF-2 

10 sFv, and VP 1 6 is delivered to the nucleus, where it binds ATF-2 and activates 
transcription. 

Figures 3A-C show the attachment of a translocating protein (VP22) to an 
oligonucleotide (oligo) by generation of a Afunctional linker molecule. Figure 3 A 
shows the chemical structure of a phenylboronic acid (PBA)-adapted nucleotide 
1 5 (PB A-dUTP). Figure 3B shows the chemical structure of a salicylhydroxamic acid 
(SHA)-adapted amino acid (R = lysine). Figure 3C shows the reaction of the PBA- 
adapted nucleotide and the SHA-adapted amino acid to create a Afunctional linker 
molecule that attaches the oligonucleotide to VP22. 

Figure 4 is a schematic diagram illustrating a VP22-T7 RNA polymerase (T7 
20 pol) expression system. VP22-T7Pol accumulates in the nucleus upon exogenous 
addition to tissue culture cells. In the nucleus, the VP22-T7 pol fusion protein 
recognizes the T7 promoter and activates transcription of gene X. 

Figure 5 is a map of vector pVP22/Myc-His, which contains the T7 promoter 
(T7), VP22 open reading frame (VP22), a multiple cloning site, a myc epitope (myc), 
25 and a polyhistidine tag (6xHis). 

Figures 6A and B show the nucleotide sequence of vector pVP22/Myc-His 
(SEQIDNO:!). 

Figure 7 is a map of pVP22/Myc-His-TOPO® vector, which contains the T7 
promoter (T7), VP22 open reading frame (VP22), a multiple cloning site modified by 
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covalent coupling of the Vaccinia Virus Topoisomerase I protein (T) to linearized 
vector DNA, a myc epitope {myc\ and a polyhistidine tag (6xHis). A PCR product 
with a single 3' A base overhang can be inserted into the topoisomerase-adapted site. 

Figures 8A and B show the nucleotide sequence of pVP22/Myc-His-TOPO® 
5 vector (SEQ ID NO:2). 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there are provided method(s) for 
modulating a cellular process by contacting a cell in culture under suitable conditions 
with a cell process-modifying molecule attached to a translocating polypeptide, 
10 whereby the cell process-modifying molecule is translocated into the cell and interacts 
specifically therein with a target site responsive to the cell process-modifying 
molecule, thereby modulating a cellular process in the cell. 

As used herein, the term "translocating protein" means a protein, polypeptide, 
or functional fragment thereof, that crosses biological membranes. Translocating 

15 proteins, polypeptides, functional fragments and homologues thereof, possess the 
following properties: resistance to proteolysis, receptor-independent penetration of 
cell membranes, and substantially energy-free penetration of cell membranes. 
Exemplary translocating proteins that can be used in the invention methods and 
constructs include VP22 from Herpes Simplex Virus type 1 (G. Elliot and P. O'Hare, 

20 1 997, supra), a fragment of the Antennapedia protein from Drosophila (Antp) (amino 
acids 43 through 58) (5 ' -RQIKI WFQNRRMK WKK-3 * ) (SEQ ID NO:21) (Axcrona 
et ah, supra 1999), Protein H from Streptococcus pyogenes (D. Derossi et al, J. Biol 
Chem., 221:18188-93, (1996)), and the like. While each translocating protein has 
distinct properties, the general application of translocating proteins is to deliver other 

25 molecules to cells, either by constructing a fusion molecule (e.g., a fusion protein) or 
by attaching the desired molecule to the translocating protein (e.g. covalently or by 
means of a linker). In fusion proteins the translocating protein can be located either in 
the N-terminal or the C-terminal position. The preferred fusion protein or polypeptide 
for use in practice of the invention methods is a VP22 polypeptide. 
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The term "VP22 polypeptide" is used herein to refer to the herpes viral VP22 
protein, as well as to functional fragments thereof, that have the translocating 
properties of the intact protein. In addition, the term "VP22 polypeptide" as used 
herein encompasses homologues of VP22 protein, such as those derived from 
5 varicella zoster virus (VZV), equine herpesvirus (EHV), bovine herpesvirus (BHV), 
and the like, and transport-active (i.e. "functional") fragments, mutants and chimeric 
combinations thereof. 

In particular, VP22 polypeptide encompasses polypeptides corresponding to 
amino acids 60-301 and 159-301 of the full HSV1 VP22 sequence (1-301), whose 
10 sequence is disclosed in Figure 4 in WO 97/05265. Homologous proteins and 

fragments based on sequences of VP22 protein homologues from other herpes viruses 
are described in U.S. Patent 6,017,735, which is incorporated herein by reference in 
its entirety. 

The term "fusion protein" as used herein refers to two distinct proteins, 
1 5 polypeptides, peptides, and/or fragments not normally associated with each other in 
nature that are encoded by the same reading frame, resulting in the two or more 
distinct proteins and/or fragments being "fused" together. The fusion proteins used in 
invention methods are produced from nucleotide sequences encoding a translocating 
polypeptide, e.g., a VP 22 polypeptide, and another functional peptide in the same 
20 reading frame. The polynucleotide encoding the fusion protein may also contain in 
the same reading frame additional peptide or polypeptide sequences useful in the 
invention methods, such as epitope-tag encoding sequences, affinity purification-tag 
encoding sequences, additional functional protein encoding sequences, and the like, or 
a combination of any two or more thereof. 

25 In one embodiment, the invention provides method(s) for transfecting a cell 

with a target gene by contacting the cell under suitable conditions with a 
polynucleotide comprising the target gene attached to a translocating polypeptide, 
whereby the cell is transfected with the target gene. As used herein, the term 
"transfected" means that a gene translocated into a cell in culture due to the 
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translocating properties of an attached translocating polypeptide is expressed in the 
cell, at least transiently, i.e., the cell is transiently transfected with the target gene. 

The size of polynucleotide that can be transfected into a cell according to the 
invention methods ranges from about 10 nucleotides to about 10 kilobases (kb). For 
5 example, polynucleotides in the range from about 20 nucleotides (nt) to about 5 kb, or 
from about 100 to 500 nt can be transfected into cells using the invention methods. 
Generally, the target polynucleotide is transiently transfected into a cell population in 
culture, for example, in a monolayer or tissue culture. None of the conventional 
means used to assist transfection or transduction is required, such as electroporation, 

1 0 infection employing viral vectors, calcium phosphate transfection, dextran sulfate 

transfection, lipofection, cytofection, particle bead bombardment, and the like. Instead, 
all that is required is contact (i.e., co-culture) of the cell population to be transfected with 
purified translocation protein or with synthetically prepared translocating protein 
having a polynucleotide of interest attached thereto by means of a covalent bond or 

1 5 linker molecule, as described herein. Any type of prokaryotic or eukaryotic cell in 
culture can be transfected using invention methods, for example, mammalian, yeast, 
insect or plant cells. However, it is presently preferred that the cells in culture be a 
monolayer of mammalian or insect cells. 

In invention methods wherein a translocating protein is attached to plasmid 
20 DNA (i.e., via either covalent or non-covalent interactions), the DNA can be delivered 
to the nucleus for gene expression. Delivery of DNA using translocating proteins as 
described herein is an extremely valuable research tool. In up to 100% of the cells 
into which a desired polynucleotide containing an open reading frame (e.g., a 
polynucleotide contained in a plasmid) is delivered by an invention translocating 
25 protein, the polynucleotide is internalized, transported to the nucleus, and the open 
reading frame is then expressed, thus creating a homogeneous population of cells for 
studying such cell processes as cell cycle regulation, transcription regulation, 
translation regulation, and the like. 

In another embodiment according to the invention, method(s) are provided for 
30 modulating expression of a target gene product in a cell in culture that contains a 
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target gene under control of one or more regulatory elements. In this embodiment, the 
invention method is practiced by contacting the cell in culture under suitable 
conditions with one or more regulatory agents attached to a translocating polypeptide, 
whereby the one or more regulatory agents are translocated into the cell in culture and 
5 interact therein with the one or more regulatory elements, thereby modulating 
expression of the target gene product by the cell 

For example, a polynucleotide attached to a translocating polypeptide, such as 
VP22, can be translocated into the nucleus of the cell for expression of all or a part of 
the polynucleotide. In one embodiment, the polynucleotide comprises an open reading 
1 0 frame encoding a protein of interest, such as a target gene product or reporter gene 
product. Alternatively, the polynucleotide can be a vector (e.g., a supercoiled 
plasmid) containing a cloned open reading frame that encodes a target gene. 

It has been discovered that the translocating protein and attached cell process- 
modifying molecule can be directed to the cytoplasm for expression as well as to the 

1 5 cell nucleus of the population of cells in culture if the translocating protein is attached 
(e.g., fused) to a nuclear export signal (NES). Signals for the export of proteins from 
the nucleus have recently been described. Analysis of PKI (heat stable inhibitor of 
cAPK, cyclic AMP-dependent protein kinase A) (Y. Wang et al., Gene Therapy 4, 
432-441 (1997)) and the HIV Rev protein (W. Wen et al, Cell 32, 463-473 (1995)) 

20 has revealed a leucine rich sequence that is sufficient to direct heterologous sequences 
out of the nucleus and into the cytoplasm. Furthermore, fusion of the NES to a 
heterologous protein that includes the canonical SV40 larger T antigen NLS has been 
shown to result in the distribution of the protein between the cytoplasmic and nuclear 
compartments (Wang et al., supra). Similarly, the Rev protein contains sequences for 

25 both nuclear import and export and is found in both the cytoplasmic and nuclear 

compartments of cells (Wen, et al, supra). Thus, incorporation of a NES is a potential 
method to modulate the nuclear targeting of translocating proteins, such as VP22, 
especially since the PKI NES can partially counteract the very strong signals of the 
SV40 NLS. When attached to a nuclear export signal, the translocating polypeptide 

30 and any attached polynucleotide can be stably introduced into the cytoplasm as well 
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as the nucleus of the cells in culture, thereby accomplishing partition of the 
polynucleotide between cellular compartments. In the cytoplasm, regulation of 
expression of a gene contained in the polynucleotide can be regulated using invention 
methods as described herein. 

5 Nuclear export signals suitable for use in the practice of the invention are 

known in the art and include the nuclear export signals derived from the HIV Rev 
protein or the heat stable inhibitor of cAPK, and the like. In many cases, inclusion of 
a nuclear export signal into the translocation protein-containing construct can be used 
to stably integrate a target gene of interest into the genome of the cells in culture. 

1 o A cell in culture can be contacted with a translocating protein attached to a 

cell-modifying molecule according to the present invention by a variety of methods. 
In the one method, an expression cell population transfected with a polynucleotide 
encoding the translocating protein fused to the cell-modifying molecule (e.g., as a 
fusion protein) is mixed and co-cultured with a target cell population that 

1 5 spontaneously takes up the expressed translocation protein with attached cell- 
modifying molecule. The expressed protein accumulates in the cytoplasm of the 
transfected expression cells and, by translocating across cell membranes, spreads to 
the surrounding non-transfected cells where it accumulates in the nuclei. For 
example, the expression cell can be a prokaryotic cell line, such as E. coli, and the 

20 target cell line can be any eukaryotic cell line, for example a mammalian cell line, 
such as CHO or COS, or an insect cell line, such as Drosophila S2, and the like. 

Alternatively, the expression cell population can be cultured under conditions 
that promote expression of a transfected gene, a cell lysate can be prepared of the 
transfected expression cell population and the lysate can be applied to a cultured 

25 target cell population using methods known in the art and as described in the 

Examples herein. When the translocation protein is VP22, the VP22 or fusion protein 
containing VP22 will translocate to the nuclei of substantially 100% of the cell 
population. It is also possible to culture the target cells with purified translocation 
protein-containing molecules or with synthetically prepared molecules containing the 

30 translocating protein attached to a polypeptide or nucleotide by means of a covalent 
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bond or linker, as described herein. The translocating polypeptide and attached 
molecule will translocate to an entire cell population in culture within about 10 
minutes to about 72 hours, more typically within about 10 minutes to about 50 hours; 
preferably within about 10 minutes to about 24 hours. However, in some cases, no 
5 more than about 1 0 minutes is required for uptake of a translocating polypeptide and 
attached molecule by an entire cell population. 

Fusions of translocating polypeptides with known DNA binding proteins can 
also be used to deliver DNA containing an open reading frame (e.g., a plasmid) to 
tissue culture cells. In this embodiment of the invention methods, the DNA binding 

1 0 protein acts as a linker for attaching the translocating protein to the cell-modifying 
polynucleotide (i.e. a plasmid containing a polynucleotide that acts to modify a cell 
process). Examples of protein linkers that may be fused to translocating proteins for 
the delivery of polynucleotides, such as plasmid DNAs, include histone 1 (HI) protein 
(M. Wilke et al., supra and Niidome, et al., 1 Biol Chem. 222, 15307-15312 (1997)) 

15 and the non-histone protein HMG-17 (high mobility group 17) (S. V. Zaitsev, et al, 
Gene Ther. 4, 586-592 (1997)). HMG-17 interactions with DNA have been studied in 
depth and demonstrate that HMG-17 interacts with DNA in a non-cooperative, non- 
specific, and reversible manner (M. Bottger et al, Arch. Geschwulstforsch £Q, 265- 
270 (1990)). In each case, either the entire DNA binding protein, or a functional 

20 fragment thereof (i.e. a fragment having DNA binding activity) may be used. 

It may be preferred to complex the DNA with a reagent, such as 
polyethylenimine (PEI), that condenses and neutralizes the charged DNA prior to 
mixing with the translocating protein, or translocating protein-DNA binding domain 
fusion. 

25 Alternatively, if a shorter peptide linker is advantageous in the particular 

system used, the peptide linker may be fused to a translocating protein either as a 
chemically synthesized peptide or as a nucleotide encoding a fusion protein to be 
expressed in a prokaryote expression system. Examples of short peptide sequences 
that may be fused to a translocating protein either as a chemically synthesized peptide 

30 or as a fusion protein include polylysine sequences and sequences containing three or 
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more repeats of the peptide sequence LARL, for example, LARL-LARL-LARL (SEQ 
ID NO:3) (J. D. Fritz et al, Hum. Gene Then 2:1395-1404 (1996)). In some cases, 
from three to about 100 repeats of the LARL sequence may be used as a linking 
peptide as described herein; typically from 3 to about 50 repeats, with 3 up to about 
5 20 repeats being presently preferred. 

A preferred linker for attaching a translocating protein to a cell-modifying 
polynucleotide is the Vaccinia virus topoisomerase I protein, or a mutant form 
thereof, which allows the formation of stable topoisomerase I-DNA conjugates. 
Vaccinia DNA topoisomerase, a 314 aa virus-encoded eukaryotic type I 

1 0 topoisomerase (I), binds to duplex DNA and cleaves the phosphodiester backbone of 
one strand (S. Shuman and B. Moss, Proc. Natl. Acad. Sci. USA £4: 7478-7482 
(1987)). The enzyme exhibits a high level of sequence specificity, akin to that of a 
restriction endonuclease. Cleavage occurs at a consensus pentapyrimidine element 
¥-(CrT)CCTT-3* in the scissile strand (S. Cheng et al., Proc. Natl. Acad. Sci. USA 

15 21: 5695-5699 (1994); J.M. Clark, Nucleic Acids Res. 1£: 9677-9686 (1988) ; and 
S.G. Morham and S.J. Shuman, Biol Chem. 2£Z: 15984-15992 (1992)). In the 
cleavage reaction, bond energy is conserved via the formation of a covalent adduct 
between the 3' phosphate of the incised strand and a tyrosyl residue (Tyr-274) of the 
protein. Vaccinia topoisomerase can religate the covalently held strand across the 

20 same bond originally cleaved (as occurs during DNA relaxation) or it can religate to a 
heterologous acceptor DNA and thereby create a recombinant molecule. When 
attached to an invention translocating protein, the Vaccinia topoisomerase I linker will 
attach to a double stranded oligonucleotide having single 5' A base overhangs, such 
as are created in Taq mediated PCR. Such topoisomerase I-DNA conjugates may 

25 then be introduced into cells. 

Figure 7 illustrates a suitable vector wherein Vaccinia topoisomerase I linker 
is used to attach a translocating protein to a double-stranded oligonucletide of interest. 
Vector pVP22/Myc-His TOPO® (SEQ ID NO:2), utilizes Vaccinia topoisomerase I 
linker to attach VP22 to a double stranded PCR product (i.e., a cell-process modifying 
30 oligonucleotide) having single 5' A base overhangs to create a VP22 fusion with 
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vector DNA. Such topoisomerase I-DNA conjugates may then be introduced directly 
into cells. 

In another embodiment, a translocating protein is used to increase the 
efficiency of plasmid delivery in conjunction with a cationic liposome. Fusion of a 
5 translocating protein to a protein domain that readily associates with a cationic 
liposome, for example a hydrophobic transmembrane domain or a 
glycosylphosphatidylinositol (GPI) anchor, facilitates interaction at the lipid-DNA 
interface. Following endocytosis of the liposome-DNA complex, the translocating 
protein will translocate the complex through the endosomal membrane and into the 
10 cell cytoplasm and, eventually, to the nucleus for gene expression. Translocating 
proteins may also be used to enhance transfection efficiencies in conjunction with 
compounds, such as chloroquine, that inhibit lysosomal hydrolases (Niidome et al., /. 
Biol C*«».,2Z2:5307-12 > 1998). 

Polynucleotides encoding fusion proteins may be constructed by standard 
1 5 molecular biology techniques (J. Sambrook, E. F. Fritsch and T. Maniatis (1989). 
Molecular Cloning, A Laboratory Manual Cold Spring Harbor Laboratory Press. 
Cold Spring Harbor, NY), transfected into tissue culture cells and tested for 
translocation ability by use of suitable methods, e.g., immunofluorescence, as are 
known in the art. See also the methods discribed in the Examples herein. 

20 Inducible systems are used to study the phenotypic effects of protein 

expression. Since inducible systems allow expression of a protein on demand, such 
systems can be used as a research tool to study cell processes and even to enable the 
expression of toxic proteins in tissue culture. Current systems for inducible 
mammalian expression use transcriptional elements from diverse organisms, for 

25 example, E.coli (U. Fischer et al., Cell £2:475-483 (1995)), or Drosophila (M. Gossen 
et al., 7/5518:471-475 (1993)) that are constitutively expressed in a cell line along 
with a vector that contains a promoter responsive to transcriptional regulators. 
Addition of an effector molecule causes binding of the transcriptional regulators to the 
inducible promoter, thus turning on gene expression. 
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The present invention provides a novel approach to this problem by providing 
method(s) for modulating expression of a target gene product in a mammalian cell 
transfected with the target gene under control of one or more regulatory elements. In 
the invention method, the target cell is contacted under suitable conditions with one or 

5 more regulatory agents attached to a translocating polypeptide, whereby the one or 
more regulatory agents are translocated into the mammalian cell and interact therein 
with the one or more regulatory elements, thereby modulating expression of the target 
gene product by the cell. The translocating polypeptide used in invention methods for 
modulating expression of a target gene product can be any of the translocating 

1 0 polypeptides disclosed herein, but is preferably a VP22 polypeptide. 

The regulatory agent can be a polynucleotide, a protein or polypeptide, or a 
small molecule. For example, the regulatory element can be a promoter operatively 
linked to a target gene wherein translocation of the regulatory agent transactivates 
expression of the target gene product by the promoter. It is preferred that the 
1 5 regulatory agent be specific for the promoter, such as a polymerase specific for the 
promoter. 

An exemplary inducible system according to the present invention utilizes the 
RNA polymerase of bacteriophage T7, which has been used to direct gene expression 
in mammalian cells. Expression of T7 RNA polymerase (T7 RNAP) by Vaccinia 

20 virus (A. Ramsey-Ewing and B. Moss, J. Biol. Chem. 221:16962-16966, 1996; T. R. 
Fuerst et al., Proc. Natl Acad. Sci. £2:8122-8126, 1986) or in a stable cell line (O. 
Elroy-Stein and B. Moss, Proc. Natl. Acad. Sci. 52:6743-6747, 1990 and A. Lieber et 
al., Nucleic Acids Res. 12:8485-8493, 1989), or introduction of T7 RNAP protein at 
the time of transfection (X. Chen, et al., Cancer Gene Ther. 2:281-289, 1995 and X. 

25 Chen et al., Nucleic Acids Res. 22:21 14-2120, 1994) promotes specific expression of 
genes that are located downstream of the small T7 promoter. The specificity that T7 
RNAP has for the T7 promoter ensures that the desired gene is expressed and that 
non-specific gene activation does not occur. Expression using T7 RNAP has been 
reported to be very strong, 6-fold higher than the RSV promoter in one case (A. 

30 Lieber et al., Nucleic Acids Res. 12:8485-8493, 1 989). In addition, gene expression 
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can be directed by T7 polymerase either in the nucleus (Lieber, supra and J. J. Dunn 
et al., Gene 65:259-266, 1988) or the cytoplasm of cells. These characteristics 
suggested that T7 RNAP can be used to specifically regulate gene expression by the 
addition of a T7 RNAP/VP22 fusion protein to cells containing a T7 promoter 

5 construct. Direct delivery of T7 RNAP using VP22 technology allows specific 
control of gene expression and minimizes negative effects of delivery to non-target 
sites. Thus, the methods of the present invention allow for study of the phenotypic 
effects of protein expression on demand. Due to the specificity of the invention 
inducible system, the expression of toxic proteins can also be studied in tissue culture. 

10 "Toxic proteins," as the term is used herein, refers to proteins that have immediate 
intrinsic toxic potential for living systems, including those trans-dominant mutations 
in proteins leading to constituitively active forms of the protein. Thus toxic proteins 
are distinguished from "pro-drug" type molecules that require modification after 
expression to release a toxic potential. Non-limiting examples of toxic proteins that 

15 can be used in practice of the invention methods are various oncogene products, such 
as Raf and Ras oncogene products (Reviewed by Avruch et al. Trends in Biology 
12:279-83, 1994). 

Alternatively, the regulatory agent can be a transcription factor specific for the 
regulatory element so that translocation of the regulatory agent transactivates 

20 expression of the target gene product. For example, the translocating protein can be 
fused to a DNA binding domain, such as that from the Gal4 protein, and to a common 
transactivation domain, such as VP 16 or B42. In this embodiment of the invention, 
the translocating protein-containing fusion protein will localize to the nucleus and 
then specifically activate a promoter which contains upstream binding sites for the 

25 DNA binding domain incorporated into the fusion protein. 

"DNA-binding protein(s)" contemplated for use herein belong to the well-known 
class of proteins that are able to directly bind DNA and facilitate initiation or repression 
of transcription. Exemplary DNA-binding proteins contemplated for use herein include 
transcription control proteins (e.g., transcription factors and the like; see, for example, 
30 Conaway and Conaway, Transcription Mechanisms and Regulation, Raven Press Series 
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on Molecular and Cellular Biology, Vol. 3, Raven Press, Ltd., New York, NY, 1994; T. 
Boulikas, Critical Reviews in Eukaryotic Gene Expression+Ml&lYl 17-321, 1994; A. 
Klug, Gene 125:83-92, 1993; W. M Krajewska, Int. J. Biochem., 24:1885-1898, 1992.) 

Transcription factors contemplated for use herein as a source of such DNA 
5 binding domains include, e.g., homeobox proteins, zinc finger proteins, hormone 

receptors, helix-turn-helix proteins, helix-loop-helix proteins, basic-Zip proteins (bZip), 
P-ribbon factors, and the like. See, for example, S. Harrison, "A Structural Taxonomy of 
DNA-binding Domains," Nature, 352:715-719. Homeobox DNA-binding proteins 
suitable for use herein include, for example, HOX, STF-1 (Leonard et al, MoL Endo., 

10 2:1275-1283, 1993), Mat a-2, INV, and the like. See, also, Scott et al Biochem. 
Biophys. Acta, 2S2:25-48, 1989. It has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard et al, 1993) containing the 
STF-1 homeodomain binds DNA as tightly as wild-type STF-1. Suitable zinc finger 
DNA-binding proteins for use herein include Zif268, GLI, XFin, and the like. See also, 

1 5 Klug and Rhodes, Trends Biochem. Sci., 12:464, 1987; Jacobs and Michaels, New Biol, 
2:583, 1990; and Jacobs, EMBO J., 11:4507-4517, 1992. 

The DNA-binding domain(s) used in the invention methods can also be obtained 
from a member of the steroid/thyroid hormone nuclear receptor superfamily, or be 
substantially the same as those obtained from a member of the superfamily. The 

20 DNA-binding domains of substantially all members of the steroid/thyroid hormone 
nuclear receptor superfamily are related. Such domains consist of 66-68 amino acid 
residues, and possess about 20 invariant amino acid residues, including nine cysteines. 
Members of the superfamily are characterized as proteins which contain these 20 
invariant amino acid residues. The highly conserved amino acids of the DNA-binding 

25 domain of members of the superfamily are as follows: 
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Cys-X-X-Cys-X-X-Asp*-X-Ala*-X-Gly*- 
X-Tyr*-X-X-X-X-Cys-X-X-Cys-Lys*-X- 
Phe-Phe-X-Arg*-X-X-X-X-X-(X-X-)Cys- 
X-X-X-X-X-(X-X-X-)Cys-X-X-X-Lys-X- 
5 X-Arg-X-X-Cys-X-X-Cys-Arg*-X-X- 
Lys* - Cys - X - X - X - Gly* - Met (SEQ ID NO:4); 

wherein X designates non-conserved amino acids within the DNA-binding domain; an 
asterisk denotes the amino acid residues which are almost universally conserved, but for 
which variations have been found in some identified hormone receptors; and the residues 
1 0 enclosed in parenthesis are optional residues (thus, the DNA-binding domain is a 

minimum of 66 amino acids in length, but can contain several additional residues). Such 
DNA binding domains bind to 2-half site recognition sites, as is well known in the art to 
transactivate transcription under control of a response element comprising the 
recognition site. 

1 5 The GAL4 DNA binding domain does not interact with a 2-half site DNA 

recognition site. The DNA binding domain of the yeast GAL4 protein comprises at least 
the first 74 amino terminal amino acids thereof (see, for example, Keegan et al, Science 
221:699-704, 1986). Preferably, the first 90 or more amino terminal amino acids of the 
GAL4 protein will be used, for example, the 147 amino terminal amino acid residues of 

20 yeast GALA 

Another DNA binding domain that can be used in the practice of the present 
invention is the Tet operon. The tetracycline inducible system is well-known in the art 
(see, e.g, Gossen et al., Proc. Natl Acad. Set 32:5547-5551 (1992); Gossen et al., TIBS 
15:471-475 (1993); Furth et al, Proc. Natl Acad Sci. 21:9302-9306, (1994) ; and 
25 Shockett et al., Proc. Natl. Acad. Sci. 22:6522-6526 (1995)). 

Transcription modulating domains are of two types, those that activate 
transcription of a gene sequence operatively associated with a response element that is 
responsive to the invention system (i.e., transcription activation domains) and those 
that repress or de-activate transcription of a gene sequence operatively associated with 
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a response element that is responsive to the invention system (i.e., transcription 
repression domains). The ability of the invention system to activate transcription of 
such a target gene is generally enhanced when the transcription modulating domain 
attached to the translocating protein is a transcription activation domain. Transcription 
5 activation domains contemplated for use in the practice of the present invention can be 
obtained from a variety of sources and are well known in the art. 

Such transcription activation domains are typically derived from transcription 
factors and comprise a contiguous sequence that functions to activate gene expression 
when associated with a suitable DNA-binding domain. For example, suitable activation 

1 0 domains can be obtained from the N-terminal region of members of the steroid/thyroid 
hormone nuclear receptor superfamily, from transcription factor activation domains, 
such as, for example, VP16, GAL4, NF-kB or BP64 activation domains, and the like 
(See, for example, M. Manteuffel-Cymborowska, Acta Biochim Pol 4&l):77-89 (1999); 
T. Tagami et al, Biochem BiophysRes. Commun. 252{2):358-63 (1998), W. Westin, 

1 5 Adv Pharmacol, 47:89- 1 1 2 (2000)). The activation domain presently preferred for use 
in the practice of the present invention is obtained from the C-terminal region of the 
VP 16 protein. 

Transcription repressor domains that can be used in the invention methods 
include those that repress transactivation of gene expression. Exemplary transcription 
20 repressor domains suitable for use as the transcription modulating domain in the 

invention methods include RAFT, CREM, MECP-2, SMRT, NcoR, mSin3 A, RAR, TR, 
SMRTR, and the like. 

Another way in which translocating proteins may be used in inducible 
expression systems for modulating expression of a target gene is to create gene 

25 fusions or in vitro covalent linkage with site-specific recombination sequences, which 
are sequences of nucleic acids that are specifically recognized by a particular site- 
specific recombinase. Site specific recombinases, as the term is used herein, are 
enzymes that catalyze the excision and/or recombination of nucleic acid sequences, 
and may form intermediate complexes with the transfer sequence DNA during the 

30 recombination event. These enzymes recognize a relatively short, unique nucleic acid 
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sequence that serves as a site for both recognition and recombination. Recombinases 
particularly useful in the practice of the invention are those that function in a wide 
variety of cell types because such enzymes do not require any host specific factors 
and do not require ATP to function. 

5 Two major families of site-specific recombinases from bacteria and unicellular 

yeast have been described: the integrase family and the resolvase/invertase family. In 
these recombinases, strand exchange catalyzed by site specific recombinases occurs in 
two steps of (1) cleavage and (2) rejoining, involving a covalent protein-DNA 
intermediate formed between the recombinase enzyme and the DNA strand(s). The 

10 nature of the catalytic amino acid residue of the enzyme and the line of entry of the 
nucleophile is different for these two recombinase families. For cleavage catalyzed 
by the invertase/resolvase family, the nucleophile hydroxyl is derived from a serine 
and the leaving group is the 3' -OH of the deoxyribose. For the integrase family, the 
catalytic residue is a tyrosine and the leaving group is the 5' -OH. In both 

15 recombinase families, the rejoining step is the reverse of the cleavage step. 

The recombinase activity of Cre has been studied as a model system for the 
integrases. Cre is a 38 kD protein isolated from bacteriophage PI. It catalyzes 
recombination at a 34 base pair stretch of nucleic acids called loxP. The loxP site has 
the sequence V - A T A A CTTCGT AT A GC AT AC ATT AT ACGAAGTTAT-3 ' (SEQ ID 

20 NO: 5; spacer region underlined), consisting of two 13 base pair palindromic repeats 
flanking an eight basepair core sequence (Hoess et al, Proc. Natl. Acad Sci USA 
22:3398, 1982 and U. S. Patent No. 4,959,217, the disclosure of which is incorporated 
herein by reference in its entirety). The repeat sequences act as Cre binding sites with 
the crossover point occurring in the internal spacer core. Each repeat appears to bind 

25 one protein molecule wherein the DNA substrate (one strand) is cleaved and a 

protein-DNA intermediate is formed having a 3'-phosphotyrosine linkage between 
Cre and the cleaved DNA strand. Crystallography and other studies suggest that four 
proteins and two loxP sites (each on a different DNA molecule) form a synapsed 
structure in which the DNA resembles models of four-way Holliday-junction 

30 intermediates, followed by the exchange of a second set of strands to resolve the 
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intermediate into recombinant products (see, Guo, et aL Nature 222:40-46, 1997). 
The asymmetry of the core region of the loxP recombination sequence is responsible 
for directionality of the recombination reaction. When two loxP sites on the same 
DNA molecule are in a directly repeated orientation, Cre excises the DNA between 
5 these two sites, leaving a single loxP site on the DNA molecule (Abremski et al, Cell 
22:1301, 1983). Thus, the repeat sequences act as Cre-specific binding sites with the 
recombination crossover point occurring in the core. 

The loxP site is so complex in size that it occurs only in the PI phage genome. 
Therefore, use of the loxP sites in the invention methods assures that the enzyme will 

1 0 not cut the transfer sequence within the interior of the sequence unless the transfer 
sequence rs from the Pi phage genome. The activity of Cre in a wide variety of 
cellular backgrounds, including yeast, shows that Cre does not require host specific 
factors for activity (Sauer Mol Cell Biol. 7:2087-2096, 1987) in plant (Albert et al, 
Plant J. 2:649-659, 1995; Dale and Ow, Gene £1:79-85, 1990; Odell etal, Mol. Gen. 

1 5 Genet. 222:369-378, 1 990), or mammalian cells, including both rodent and human 
cells (van Deursen et al, Proc. Natl Acad Sci. USA 22:7376-7380, 1995; Agah et al, 
1 Clin. Invest. i£Q:169-179, 1997; Sauer and Henderson, New Biologist 2:441-449, 
1990). 

The Cre protein also recognizes a number of variant or mutant lox sites 
20 (variant relative to the loxP sequence), including the loxB, loxL and loxR sites, which 
are found in the E. coli chromosome. Other variant lox sites include loxP5 1 1 
(5 ' - AT AACTTCGTAT AQIAIACATTAT ACGAAGTT AT-3 * (SEQ ID NO:6; 
spacer region underlined); loxC2 

(5 '-AC AACTTCGTATAAIGIAIGCTATACGAAGTTAT-3 y (SEQ ID NO:7; 

25 spacer region underlined; U.S. Patent No. 4,959,21 7). Additional variants of the loxP 
site can be prepared by those of skill in the art and will generally have no more than a 
total of one to three point mutations in the two repeats that comprise the site-specific 
recombination sequence. Cre catalyzes the cleavage of the lox site within the spacer 
region and creates a six base-pair staggered cut. The two 13 bp inverted repeat 

30 domains of the lox site represent binding sites for the Cre protein. The two lox sites 
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may differ so long as Cre is able to recognize both lox sites. However, if two lox sites 
differ in their spacer regions in such a manner that the overhanging ends of the 
cleaved DNA cannot reanneal with one another, Cre cannot efficiently catalyze a 
recombination event using the two different lox sites. The efficiency of the 
5 recombination event will depend on the degree and the location of the variations in 
the binding sites. For example, the loxC2 site can be efficiently recombined with the 
loxP site because the two lox sites differ by a single nucleotide in the left-binding site. 
Thus, when Cre is the site specific recombinase used in the practice of the invention 
methods, the site-specific recombination sequence is a loxP site, or a variant thereof 
1 0 recognized by the Cre enzyme. 

A.recombinase of the integrase family with similar function is Flp, a 
recombinase identified in strains of Saccharomyces cerevisiae that contain 2p.-circle 
DNA. Flp recognizes a DNA sequence consisting of two 13 basepair inverted repeats 
flanking an 8 basepair core sequence 

15 (5 * -G AAGTTCCT ATTCICIAQAAAGT ATAGGAACTTC-3 ' (SEQ ID NO: 8); 
spacer underlined) called FRT(F\p Recombination Target site). A third repeat 
follows at the 3' end in the natural sequence, but does not appear to be required for 
recombinase activity. The Flp gene has been cloned and expressed in E coli and in 
mammalian cells (PCT International Patent Application PCT/US92/01899, 

20 Publication No: WO 92/1 5694, the disclosure of which is herein incorporated by 
reference) and has been purified (Meyer-Lean et aL, Nucleic Acids Res. 1£:6469, 
1987; Babineau et aL, J. Biol. Chem. 2£Q:12313, 1985; Gronostajski and Sadowski, J. 
Biol. Chem. 26Q:12328, 1985). 

Like Cre, Flp is functional in a wide variety of systems including bacteria 
25 (Huang et aL, J. Bacteriology 112:6076-6083, 1997), insects (Golic and Lindquist, 
Cell 52:499-509, 1989; Golic and Golic, Genetics 144:1693-171 1, 1996), plants 
(Lyznik et aL, Nucleic Acids Res 21:969-975, 1993) and mammals (U. S. Patent Nos. 
5,677,177 and 5,654,182), which shows the Flp does not require host specific factors 
for operability. 
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Additional integrases that can be used in practice of invention methods are 
retroviral integrases, including HTV and ASV integrases (Reviewed in Annu. Rev. 
Microbiol. 53:245-81, 1990). 

In practice of invention methods for modulating expression of a target gene 
5 produce, a site-specific DNA recombinase or integrase fused to a translocating protein 
may be introduced as described herein into cells that have been transfected with a 
plasmid containing a transcription-blocking sequence (e.g., a transcription termination 
sequence) flanked by recombinase recombination sites specific for the recombinase or 
integrase used and placed between a promoter and an open reading frame encoding a 
10 target gene. For example, if the recombinase is Flp, the recombinase sites are/rt sites, 
and if the recombinase is Cre, the recombinase sites are lox sites. Exposure of the 
transfected cell to the recombinase or integrase-adapted translocating protein results 
in removal of the transcriptional terminator by the activity of the recombinase and 
expression of the gene of interest as illustrated in Figure 1. 

1 5 Thus, in the invention methods for modulating a cellular process, the one or 

more regulatory elements can include a transcription-blocking sequence flanked by 
recombinase recombination sites and the regulatory agent can be a recombinase 
specific for the recombination sites, wherein translocation of the recombinase causes 
recombination of the recombination sites, thereby modulating expression of the target 

20 gene product. 

Alternatively, rather than placing a pair of recombinase sites flanking a 
polynucleotide segment to be excised, a single recombinase site can be incorporated 
into (or exist naturally in) the genome of the target cell that also contains a plasmid 
containing a target gene and a second recombinase site that pairs with the genomic 
25 recombinase site. When such a cell is contacted by a recombinase (e.g. integrase) 

specific for the recombinase site(s) in the target cell, translocation of the recombinase 
will trigger a recombination event such that the target gene will become stably 
incorporated into the genome of the target cell at the genomic recombinase site. 
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The recombinase or integrase may be introduced by mixing of two cell 
populations, one expressing the translocating protein-enzyme (e.g., Flp or Cre) fusion 
and the other containing the heterologous gene. Alternatively, the translocating 
protein-enzyme fusion may be produced in a prokaryotic or eukaryotic expression 
5 system, purified using known methods and as described herein, and applied to cells 
containing the heterologous gene. The cells may be either transiently transfected with 
the heterologous gene or carry it stably integrated in their genomes. 

Alternatively, the regulatory agent used in invention methods for modulating 
gene expression can be the HIV Rev protein paired with the Rev regulatory element 
1 0 as regulatory element (RRE). As illustrated in Example 5 herein, increasing amounts 
of Rev protein delivered to target cells containing RRE result in increased expression 
of an operatively linked target gene. 

In another embodiment of the invention method for modifying a cellular 
process, the protein molecule fused to a translocating protein is a Fv antibody 

1 5 fragment or a single chain antibody (sFv). Preferably polynucleotide encoding a 
fusion protein containing the translocating protein and sFv is introduced into cells in 
culture, as described herein, for translocation to the cell nucleus and intracellular 
expression. If the sFv is specific for an antigen target associated with intracellular 
machinery involved in a cellular function, for example, a target located within the cell 

20 nucleus, binding of the sFv to the intracellular target can interfere with cellular 

functions, such as Ras signaling (0. Elroy-Stein, T. R. Fuerst, B. Moss, Proc. Natl. 
Acad. Set 86, 6126-6130 (1989), membrane transport (O. Cachet, et al., Cancer 
Research 58, 1 170-1 176 (1998)), or viral replication (J. H. Richardson, J. G. 
Sodroski, T. A. Waldmann, W. A. Marasco, Proc. Natl Acad. Sci. 92, 3137-3141 

25 (1995)). Additional exemplary intracellular targets for which single chain antibodies 
can be constructed and used in the invention methods to modify cellular processes 
include human kinases, transcription factors, proteins controlling apoptosis, cell cycle 
regulators, oncoproteins, and the like. 

Therefore, the one or more regulatory agents used in the invention method(s) 
30 for regulating cell processes can include a Fv or sFv specific for a component of the 
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one or more regulatory elements within the cells in culture (whether the regulatory 
element is native or transfected into the cells), wherein translocation of the Fv or sFv 
into the cell by the translocating protein and binding of the antibody to the component 
modulates expression of the target gene product. 

For example, intracellular processes have been modified by creating a fusion 
protein containing an anti- ATF-2 sFv fused to VP22 and the transcriptional activation 
domain of VP 16 (Figures 2A-D). ATF-2 belongs to the bZIP family of transcription 
factors and controls gene expression via 8-bp ATF/CREB motifs, either as a 
homodimer or as a heterodimer — for instance, with Jun (S. Huguier et aL, Molecular 
and Cellular Biology i£:7020-7029, 1998). If the fusion protein is expressed within a 
reporter cell line that has ATF-2 bound upstream of a reporter gene, e.g., the bacterial 
luciferase gene, binding of the sFv in the fusion protein to the ATF-2 antigen in the 
cell nucleus (Figures 2A-D) triggers expression of the ATF-2 sFv-VP16 fusion 
(Figure 2B), but not a CREB sFv-VP 16 fusion (Figure 2C), resulting in expression of 
the reporter gene. This experiment demonstrates that the ATF-2 sFv is delivered to 
the cell nucleus where it binds the ATF-2 antigen. 

"Fv" as used herein means a genetically engineered fragment containing the 
variable region of the light chain and the variable region of the heavy chain expressed 
as two chains but chemically linked; "sFv" as used herein means a genetically 
20 engineered molecule containing the variable region of the light chain and the variable 
region of the heavy chain, linked by a suitable polypeptide linker as a genetically 
fused single chain molecule. 

The linkage of light chain and heavy chain variable regions in a Fv may be 
noncovalent, as described in Inbar et aL, Proc. Nat'l Acad. Sci. USA £2:2659-62, 
25 1 972. Alternatively, the variable chains can be linked by an intermolecular disulfide 
bond or cross-linked by chemicals such as glutaraldehyde. 

Exemplary linkers used to attach two segments of a Fv or to attach any other 
two proteins to (e.g., a translocating protein and a DNA binding protein) can be a 
Afunctional cleavable cross-linker, such as N-succinimidyl (4-iodoacetyl)- 
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aminobenzoate; sulfosuccinimydil (4-iodoacetyl)-aminobenzoate; 4-succinimidyl- 
oxycarbonyl-a-(2-pyridyldithio)toluene; sulfosuccinimidyl-6- [a-methyl-a- 
(pyridyldithiol)-toluamido] hexanoate; N-succinimidyl-3-(-2-pyridyldithio)- 
proprionate; succinimidyl 6[3(-(-2-pyridyldithio)-proprionamido] hexanoate; 
5 sulfosuccinimidyl 6[3(-(-2-pyridyldithio)-propionamido] hexanoate; 3-(2- 

pyridyldithio)-propionyl hydrazide, Ellman's reagent, dichlorotriazinic acid, S-(2- 
thiopyridyl)-L-cysteine, and the like. Further Afunctional linking compounds are 
disclosed in U.S. Patent Nos. 5,349,066. 5,618,528, 4,569,789, 4,952,394, and 
5,137,877, each of which is incorporated herein by reference in its entirety. 

10 These linkers can be attached to purified proteins using numerous protocols 

known in the art, such as those described in Examples 1 and 2 (see Pierce Chemicals 
"Solutions, Cross-linking of Proteins: Basic Concepts and Strategies;' Seminar #12, 
Rockford, IL). 

Preferably the antibodies used in the invention methods are sFv, comprising 
15 V H and V L chains connected by a peptide linker. These single-chain antigen binding 
proteins (sFv) are prepared by constructing a structural gene comprising DNA 
sequences encoding the V H and V L domains connected by an oligonucleotide. The 
structural gene is inserted into an expression vector, which is subsequently introduced 
into a host cell such as E. coli. The recombinant host cells synthesize a single 
20 polypeptide chain with a linker peptide bridging the two V domains. Methods for 
producing sFvs are described, for example, by Whitlow and Filpula, Methods, 2- 97- 
105, 1991; Bird et a/., Science 242:423-426, 1988; Pack et al, Bio/Technology 
11:1271-77, 1993; and Ladner et a/., U.S. patent No. 4,946,778, which is hereby 
incorporated by reference in its entirety. Such well known procedures can be 
25 modified to create fusion proteins comprising a sFv and a translocating protein, as 
described herein. 

For example, the linker in the sFv can be a peptide having from about 2 to 
about 60 amino acid residues, typically from about 5 to about 40, preferably from 
about 10 to about 30 amino acid residues. This alternative is particularly 
30 advantageous when the ligand moiety is proteinaceous. For example, the linker 
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moiety can be a flexible spacer amino acid sequence, such as those known in single- 
chain antibody research. Examples of such known linker moieties include GGGGS 
(SEQ ID NO:9), (GGGGS) n (SEQ. ID NO:10), GKSSGSGSESKS (SEQ ID NO:l 1), 
GSTSGSGKSSEGKG (SEQ. ID NO: 12), GSTSGSGKSSEGSGSTKG (SEQ ID 
5 NO:13), GSTSGSGKSSEGKG (SEQ ID NO:14), GSTS GSGKPGSGEGSTKG (SEQ 
ID NO: 15), EGKSSGSGSESKEF (SEQ ID NO: 16), SRSSG (SEQ. ID NO: 17), 
SGSSC (SEQ ID NO: 18), and the like. A Diphtheria toxin trypsin sensitive linker 
having the sequence MGRSGGGCAGNRVGSSLSCGGLNLQAM (SEQ ID NO: 19) 
is also useful. Alternatively, the peptide linker moiety can be VM or AM, or have the 

10 structure described by the formula: AM(G 2 t0 4S) x AM wherein X is an integer from 1 t 
to 1 1 (SEQ ID NO:20). Additional linking moieties are described, for example, in 
Huston et a/., PNAS £5:5879-5883, 1988; Whitlow, M., et al, Protein Engineering 
6:989-995, 1993; Newton et al, Biochemistry 21:545-553, 1996; A. J. Cumber et al, 
Bioconj. Chem. 2:397-401, 1992; Ladurner et al, J. Mol Biol 222:330-337, 1997; 

1 5 and U.S. Patent. No. 4,894,443, the latter of which is incorporated herein by reference 
in its entirety. 

It is contemplated to be within the scope of the present invention that the 
target gene within a cell in culture can be a reporter gene, such as is known in the art, 
for example, a non-endogenous gene encoding a detectable marker, such as the E. coli 
20 B-galactosidase gene, luciferase, or CAT. 

As an aid in purifying the fusion molecules or detecting expression triggered by 
use of the invention methods, it is often convenient to include in the polynucleotide that 
encodes the reporter gene an additional nucleotide sequence that encodes a protein 
tag, such as an antibody epitope (e.g., derived from Myc), a fluorescent peptide, or a 
25 poly His tag. 

A variety of methods can be used for attaching a peptide or oligonucleotide 
molecule to a translocating protein. For example, the translocating protein can be 
covalently conjugated to a translocating protein or to a polynucleotide encoding a 
translocating polypeptide for use in the invention methods using two low molecular 
30 weight chemical affinity ligands that can be attached to macromolecules like DNA 
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and to proteins and which combine to form a linker useful in preparation of fiision 
proteins or fusion genes used in the invention methods. Two such low molecular 
weight chemical affinity ligands are salicylhydroxamic acid (SHA) and phenylboronic 
acid (PBA), which quickly react to form a reversible pH-sensitive covalent bond 
5 (Figures 3 A-C), thus providing a convenient linker to attach a translocating protein to 
another protein or to a polynucleotide. For example, nucleic acid molecules 
containing PBA can be synthesized using PBA-NTPs (available from ProLinx, 
Seattle, Washington) or, if double stranded, can be labeled with PBA-ATP using the 
enzyme terminal transferase. SHA-NHS ester can also be used to attach SHA to 

10 lysine residues present in a translocating protein. In this embodiment, a PBA-adapted 
molecule and a SHA-adapted translocating protein are covalently linked and applied 
to cells. Alternatively, other linkers, such as disulfide bonds (which would be 
disrupted upon delivery to the cytoplasm) or Afunctional linker molecules (e.g., as 
disclosed herein) may also be used. Covalent linking using these or other linkers 

1 5 known in the art and disclosed herein provides a relatively stable attachment of the 
translocating protein to another molecule. 

In addition, a number of strong non-covalent molecular interactions can be 
used to generate translocating protein-containing complexes. For example, 
Strepavidin binds biotin very strongly (the disassociation constant is approximately 

20 10" 15 ). This strong affinity, which is routinely used to attach proteins to substrates, 
can be used to form a linker that attaches a cell process-modifying molecule to a 
translocating polypeptide. For example, a fusion protein containing VP22 
translocating protein and strepavidin may be generated and complexed with a 
biotinylated oligonucleotide to form a linker attaching a cell process-modifying 

25 polynucleotide to a translocating polypeptide. Strepavidin binds biotin as a tetramer 
(tetramer MW = 60,000 daitons) and VP22 is believed to act as a multimer, making 
this combination a suitable one. 



30 



Another polypeptide molecule that may be used as a linker to attach a cell 
process-modifying molecule to a translocating polypeptide for use in invention 
methods is the single stranded DNA binding protein (SSB) from E coli. Only 21 
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amino-acid residues (amino acid residues 2 through 22) of SSB appear to be involved 
in binding to ssDNA (i.e., "the functional fragment of SSB"). Furthermore, binding 
of SSB to ssDNA is not sequence specific. Therefore, a fusion protein containing a 
translocation protein fused to a functional fragment of SSB is an extremely attractive 

5 linker for attaching a translocating protein to an oligonucleotide or to plasmid DNA. 
Unlike the linking molecules described above (i.e., those containing PBA and SHA or 
strepavidin and biotin), which require modification of the oligonucleotide to be linked 
to the translocating protein, fusion proteins containing a SSB and a translocating 
protein can be attached to unmodified DNAs, thus providing time and cost-saving 

10 advantages. 

The invention will now be described in greater detail by reference to the 
following non-limiting examples. 

EXAMPLE 1 

Introduction of VP22 fusion protein into cells in culture by transfection 

The complete open reading frame (ORF) encoding the VP22 protein was 
cloned into the eukaryotic expression vector pcDNA3.1/myc-His (Invitrogen, San 
Diego, CA), to create the vector pVP22/A<yc-His (Figure 5; SEQ ID NO:l), in which 
the ORF of the fusion partner can be inserted into a multiple cloning site located 
between the VP22 ORF and sequences encoding the C-terminal Anti-myc epitope and 
a poly His tag. The anti-myc epitope provides for easy detection of recombinant 
protein with Anti-myc antibody, and the poly His tag is useful for purification. 
Alternatively, the vector used was modified by covalent coupling of the Vaccinia 
Virus Topoisomerase I protein to linearized vector DNA (e.g., pVP22 TOPO® TA 
Cloning® Kit (Invitrogen)). In this type of vector, the ORF of a gene product of 
interest (i.e., a "fusion partner") was cloned as a PCR product into the vector. An 
example of such a Topoisomerase-adapted vector encoding the VP22 polypeptide is 
pVP22/Myc-His TOPO® vector (Figure 7; SEQ ID NO:2). In either case, the 
plasmid containing the VP22 gene fusion was then transfected into cells in culture 
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In a typical transfection, COS or CHO cells were seeded into 6 well plates and 
grown to approximately 50% confluence prior to transfection. For each well, 5 
DNA was diluted into 1.5 ml OptiMEM medium (Gibco BRL, Chagrin Falls, OH) 
and mixed with 15 y\ Pfx-6 lipid (Invitrogen) for COS cells or 15 \il Pfx-7 lipid 
5 (Invitrogen) for CHO cells. Diluted DNA plus lipid was incubated with cells for 4 hr 
at 37 °C, then replaced with the appropriate medium and incubated for an additional 
40-48hrat37°C. 

Spreading of the VP22 fusion protein from the transfected cells to the 
surrounding untransfected cells was detected by immunofluorescence using an 

10 antibody against the myc epitope tag. In a typical immunofluorescence experiment, 
transfected cells in a single 35 mm well of a six well tissue culture plate were washed 
with phosphate buffered saline (PBS) and fixed by incubation in 2 ml of methanol for 
5min. Cells were washed five times with PBS (2 ml/wash), blocked for 15 min 
using PBS containing 10% fetal bovine serum (FBS), and then incubated for 20 min 

15 with an antibody against the myc epitope tag (Invitrogen) diluted at 1 :500 in 1 ml of 
PBS containing 10% FBS. For attachment of a fluorescent molecule to the antibody, 
cells were washed twice with PBS and incubated with a goat anti-mouse Oregon 
Green conjugate (Molecular Probes, Eugene, OR; cat # 0-6383) diluted 1 :500 in 1 ml 
of PBS containing 10% FBS for 20 minutes. After two additional washes, the 

20 antigemantibody complexes were observed using an Olympus IX-70 fluorescence 
microscope equipped with a fluorescein isothiocyanate (FITC) filter. 

Translocation of several VP22 fusion proteins prepared in this way, including 
those incorporating Aequorea victoria green fluorescent protein (hQGFP), lacZ, or the 
site specific recombinase Flp as the fusion partner, has been achieved by this method. 

25 EXAMPLE 2 

Transfection of cells with a gene fusion followed by mixing with untransfected 
cells 

To demonstrate how VP22 may be used to modulate expression of a functional 
gene product, a system for delivery of the site specific DNA recombinase Flp, was 
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developed. COS cells expressing a VP22-Flp recombinase fusion protein were 
prepared as described above and mixed with CHO cells that had been transfected with 
a reporter plasmid pFIN4//acZ (Figure 1). In the reporter plasmid, a segment of DNA 
that includes a transcriptional terminator, the Bovine Growth Hormone 
5 polyadenylation signal (Goodwin and Rottman, J. Biol Chem. 262:16330-4, 1992), is 
flanked by frt sites (recombination sites recognized by the recombinase Flp) to 
separate the CMV promoter and an otherwise operatively associated reporter gene 
encoding p-galactosidase as reporter. Cells transfected with pFIN4//acZ did not 
express P-galactosidase due to the presence of the transcriptional terminator placed 
1 0 between the frt sites. 

To illustrate that expression of the reporter gene could be controlled by 
translocating of the VP22-Flp recombinase fusion protein from one cell population to 
another, two populations of CHO cells were prepared, one transfected with plasmid 
that expresses the VP22-Flp recombinase fusion protein, and another transfected with 
1 5 plasmid that expresses a VP22-GFP fusion protein. Transfections were carried out as 
described above. Twenty-four hours after the end of the transfection, cells were 
recovered by trypsinization. Then the two cell lines were mixed and incubated for an 
additional 24 hr before staining for p-galactosidase activity. 

CHO cells transfected with pFIN4//acZ only expressed p-galactosidase when 
mixed with COS cells that express the VP22-Flp fusion. In the presence of Flp 
recombinase, the segment of DNA containing the transcriptional terminator was 
removed by recombination of the frt sites, and P-galactosidase was expressed. 
Incubation of the population of CHO cells transfected with plasmid that expresses a 
VP22-GFP fusion protein, but does not express a VP22-Flp fusion protein did not 
result in expression of p-galactosidase. 

This experiment shows that the VP22-Flp fusion protein translocates between 
different mammalian cell types and that functional Flp recombinase can be delivered 
to cells as the fusion partner in a VP22 fusion protein. 
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EXAMPLE 3 

Transfection of cells with a gene fusion followed by preparation of a cell free 
lysate from the transfected cells 

In these studies, a cell free lysate was prepared from cells transfected with 
5 pVP22//>ryc-His as follows: COS cells were grown to 50% confluence in a 100 mm 
dish (approximately 10 6 cells). Cells were transfected with 20 of pVP22/myc-His 
DNA using Pfx-6. Forty hours post-transfection, the cell monolayer was washed 
twice with PBS and then collected by scraping into 10 ml PBS. Cells were 
centrifuged at 500 g for 5 min and the PBS was aspirated from the cell pellet, which 
1 0 was then frozen on dry ice. Frozen cell pellets were stored at -80°C prior to 

preparation of lysates. The cell pellet was thawed on ice following addition of 0.5 ml 
ice cold lysis buffer (10 mM HEPES, pH 7.9, 400 mM NaCl, 0.1 mM ethylene 
diaminetetraacetic acid (EDTA), 0.5 mM dithiothreitol (DTT), 5% glycerol). The 
lysate was then vortexed briefly and centrifuged at 10000 X g for 5 min at 4°C. 

15 The entire supernatant was immediately added to 2 x 10^ cells in a 35 mm 

plate without removing the tissue culture media. After a 1 0 minute incubation at 
37°C, the media was removed and VP22/myc-His protein located in the nuclei of the 
cells was detected by immunofluorescence as described above. 

An alternative method for the detection of VP22 fusion protein uptake in 
mammalian cells from a cell free lysate prepared from cells that express the fusion 
protein utilizes Western blot. In a typical Western blot experiment, HeLa, COS or 
CHO cells were plated at 50% confluence in 60 mm dishes. Following application of 
the lysate, the cells were washed once with PBS and then with PBS containing 500 
mM NaCl to remove protein non-specifically bound to the outside of the cell. The 
cells were treated with trypsin for about 5 minutes to disassociate them from the plate 
and to digest any remaining extracellular peptide. The cells were solubilized and the 
proteins separated on a 4-20% Glycine gel (Invitrogen, Carlsbad, CA). The separated 
proteins were then transferred to nitrocellulose and probed with the appropriate 
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antibody conjugated to horseradish peroxidase (HRP). The VP22 fusion proteins 
were then detected using chemiluminescence. 

Thus, the VP22/myc-His protein contained in the lysates of cells transfected 
with pVP22/myc-His translocated to the nuclei of all untransfected cells within 10 
5 minutes of contact. This finding shows that lysates containing VP22 are useful for the 
delivery of protein sequences into cell types without the need for transfection of the 
receptor cell population. 

EXAMPLE 4 

Expression of a VP22 fusion protein in E.coli. followed by application of purified 
10 protein to cells in culture 

The vector pCRT7/VP22-l was developed to allow expression and 
purification of VP22 fusion proteins from E. coll This vector utilizes a C-terminal 
fragment of the VP22 protein (amino acids 159-301), which has proven sufficient for 
translocation of VP22 fusion proteins across cell membranes. Using the above 

1 5 described methods, VP22 fusion proteins were prepared containing various proteins 
as the fusion partner (including the HIV Rev protein and human protein rhoA), and 
the fusion proteins were expressed and purified . Activity of each fusion partner was 
demonstrated following uptake by cells in culture. To demonstrate the high efficiency 
with which translocation occurs in cell cultures, even when the cells transfect poorly 

20 using conventional techniques, uptake of a VP22/GFP fusion proteins by Jurkat 
T-cells and PC 12 cells, which are known to be refractory to standard transfection 
protocols, was also performed. These experiments show that VP22 fusion proteins 
can be purified and then delivered to substantially every cell in a cultured mammalian 
cell population, completely eliminating the need for transfection, even when the cell 

25 line is known to be refractory to standard transfection protocols. 

pCRT7/VP22-l is derived from the pET9b vector backbone (Novagen, 
Madison, WI). In preparation of pCRT7/VP22-l, the sequence encoding the C- 
terminal region of VP22 sufficient for translocation activity (amino acids 159 -301), a 
fragment containing a multiple cloning site and myc and His tags from the vector 
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pVP22/myc-His were inserted into the pET9b vector backbone. The multiple cloning 
site of pCRT7/VP22-l was derived from that of pVP22/myc-His. The 
pCRT7/VP22-l vector was prepared for coupling to Vaccinia Topoisomerase I in 
exactly the same way as in preparation of the pVP22/myc-His-TOPO® plasmid, as 
5 described above. Therefore, in this vector, the sequence encoding the ORF of a fusion 
partner can be either inserted into one of the multiple cloning sites or cloned as a PCR 
product into the topoisomerase cloning site in a way similar to that used with 
pVP22/myc-His or pVP22/myc-His-TOPO® plasmid. 

In a typical experiment a VP22 fusion protein was expressed as follows. Ten 
10 ng pCRT7/VP22-l DNA was transformed into 50 \i\ BL21(DE3)plysS cells. The 
transformed cells were incubated at 37°C for 1 hour in 200 ^1 SOC medium, which 
was then diluted to 2 ml with Luria-Burtoni (LB) medium plus 50 fig/ml kanamycin 
and allowed to grow overnight at 37°C. The 2 ml culture was used to inoculate 50 ml 
LB medium containing 50 ng/ml kanamycin. Cells were allowed to grow until an 
15 optical density of 0.5 - 0.6 was attained and then allowed to continue growth at either 
37°C, or shifted to room temperature (approximately 25°C) for 30 min. One ml of 
culture was removed and allowed to continue growing. Isopropyl-P-D- 
thiogalactopyranoside (IPTG) was added to the remaining culture to a final 
concentration of 1 mM. Cells were allowed to grow for an additional 4 hours and 
20 then gel samples were prepared from induced (plus IPTG) and non-induced cultures. 
200 nl of each culture were removed, cells were recovered by centrifugation, and the 
pellets raised in 50 fxl 1 X SDS/PAGE sample buffer. Alternatively, cells were 
recovered from the remainder of the culture by centrifugation and the cell pellets 
stored at -80°C. 

25 The VP22 fusion protein was purified as follows: The cell pellet was thawed 

on ice and resuspended in 4 ml ice cold lysis buffer (50 mM Sodium Phosphate pH 
8.0, 300 mM NaCl, 5 mM imidazole). The following were added to the lysis buffer 
immediately before cell lysis: P-mercaptoethanol to 5 mM, a-toluenesulfonyl fluoride 
(PMSF) to 0.5 mM, leupeptin and pepstatin to 1 ^g/ml each, and lysozyme to 

30 1 mg/ml. The lysate was incubated on ice for 20 to 30 min and then sonicated for 
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3 x 10 sec while on ice. DNase and RNase were added to final concentrations of 
10 ^ig/ml each. The lysate was left on ice for an additional 20 min, then drawn 
through a 21 gauge needle three times, and centrifuged at 20000 g for 15 min. 
Following centrifugation, a gel sample was prepared from the soluble supernatant. 
5 The supernatant was applied to a column containing 1 ml Probond resin equilibrated 
with lysis buffer (Probond beads interact with proteins tagged with 6 histidine arrays). 
The resin and supernatant were mixed in the column on ice for one to two hours. The 
column was then clamped vertically and the resin was allowed to settle. 

A sample of the supernatant was removed to test for the presence of unbound 
protein by SDS/PAGE. The resin was washed by allowing 10 ml lysis buffer (50 mM 
Sodium Phosphate pH 8.0, 300 mM NaCl, 5 mM imidazole) to pass through the 
column. The lysis buffer was collected and a gel sample was removed. The column 
was then washed with 20 ml wash buffer (50 mM Sodium Phosphate pH 8.0, 300 mM 
NaCl, 40 mM imidazole, 10 % glycerol) and another gel sample was prepared in the 
same way. Protein was eluted by addition of buffers having increasing concentrations 
of imidazole (wash buffer with either 100 mM, 200 mM or 500 mM imidazole). 3 ml 
of each buffer was applied, and 3 ml of each of the 100 mM and 200 mM imidazole 
elutions were collected. The 500 mM imidazole elution was collected as 0.5 ml 
fractions. A gel sample was also prepared from 10 ^g of the resin after elution to 
determine if the protein remained bound. All samples were examined on 4-20% 
SDS/PAGE gels (Novex) followed by Coomassie Staining or Western blot using an 
anti-myc-HRP conjugated antibody (Invitrogen) at 1:2000 dilution. Purified proteins 
were stored at 4°C for immediate use, or frozen at -80°C for storage. 

Uptake of VP22 fusion proteins was detected by immunofluorescence. Cells were 
25 grown to approximately 50% confluence in 35 mm wells. The medium was then 

removed and replaced with 1 ml of serum free medium. Approximately 10 jig of the 
purified VP22 fusion protein, eluted in wash buffer containing 500 mM imidazole, 
was added directly to the serum-free medium. Cells were incubated at 37°C for 20 
min and then washed with 3 x 2 ml PBS. Cells were then fixed and permeablized in 
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methanol for 5 min and prepared for immunofluorscence as described previously (see 
Invitrogen pVP22/myc-His Vector, cat no. V484-1). 

Alternatively, uptake of VP22 fusion proteins was detected by Western blot. 
This technique was used to detect accumulation of VP22 fusion protein in the nuclei 

5 of PC 12 cells and Jurkat T-cells. Suspensions of approximately 5 x 10 5 cells of each 
type were transferred to 15 ml Falcon tubes. Cells were recovered by centrifugation 
at 500 g for 5 min and then resuspended in 10 ml PBS. Cells were washed again in 
the same way, then resuspended in 1 ml serum-free medium containing approximately 
10 ^g of the VP22/GFP fusion protein and incubated at 37°C for 15 min. Following 

10 the incubation, cells were washed twice by centrifugation and resuspended in 10 ml 
PBS as before described. Cells were recovered by centrifugation again, raised in 
100 pj ice cold lysis buffer (10 mM HEPES-KOH, pH 7.9, 1.5 raM MgCl 2 , 10 mM, 
KC1, 0.5 mM dithio threitol (DTT), 1% Triton X-100), and incubated on ice for 10 
min. The lysate was centrifuged at 10000 g for 10 min. The supernatant, containing 

15 soluble cytoplasmic proteins, was removed, and 4 X protein sample buffer was added 
to the supernatant. The pellet, containing cell nuclei, was resuspended in 100 ^1 1 X 
protein sample buffer. Samples were run on 4-20% SDS/PAGE gels (Novex) and 
transferred to nitrocellulose membrane. Western blots were probed with an anti-myc- 
HRP antibody conjugate (Invitrogen). 

20 EXAMPLE 5 

Activity of a VP22 fusion protein in recipient cells: functional testing of a 
VP22/Rev fusion protein 

The HIV Rev protein is encoded by HTV genomic RNA and is responsible for 
regulation of RNA splicing. The Rev protein can bind to transcripts that contain a 
25 Rev Response Element (RRE), allowing export of the transcript from the nucleus and 
subsequent translation (reviewed in V. W. Pollard et al., Ann. Rev. Micobiol £2:491- 
532, 1998). In the absence of Rev, transcripts that contain RRE will complex with the 
HTV spliceosome, but are not spliced. Instead, they remain in the nucleus and are 
degraded. 
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In the following experiment, the binding of Rev to the RRE in a transcript was 
used to activate expression of a reporter gene. A reporter plasmid (pCAT/RRE) was 
prepared that contains a CMV promoter and a CAT reporter gene separated by a 
splice donor site. The RUE was located on the 3' side of the CAT gene site. 
5 Therefore, expression of CAT in response to Rev can be used to demonstrate the 
activity of Rev in the VP22/Rev fusion protein. 

CHO cells were transfected with pCAT/RRE and then treated with either 
VP22/Rev fusion protein or VP22/myc-His control fusion protein. Expression of the 
CAT reporter gene was examined by Western blotting of protein samples prepared 

10 from treated cells, using an antibody against the CAT protein. A sample was also 
prepared from cells transfected with a CMV-CAT positive control plasmid that does 
not contain the RRE. Expression of the positive control could be detected. When 
cells were transfected with pCAT/RRE and then treated with VP22/myc-His control 
fusion protein, no expression of CAT could be detected. However, when cells 

1 5 transfected with pCAT/RRE were treated with VP22/Rev fusion protein, expression 
of CAT could be detected. When a five-fold larger amount of VP22/Rev protein was 
added, an apparent increase in the level of CAT protein was detected. These results 
show that VP22 can deliver functional Rev protein to the nucleus and lead to 
expression of a reporter gene. 

20 In HIV infected cells, the Rev protein can shuttle between the cytoplasm and 

nucleus. The distribution of Rev between these intracellular compartments is 
dependent on a nuclear export signal present in the protein. To determine whether the 
nuclear export signal functioned in the invention VP22/Rev fusion protein, the 
distribution of VP22/Rev protein was examined by immunofluorescence using an 

25 antibody against the myc epitope tag, as described above. By this procedure, 

VP22/Rev fusion protein was detected in the cytoplasm and nuclei of cells, showing 
that fusion of Rev to VP22 appears not to interfere with the ability of Rev to be 
distributed to either the cytoplasm or the nucleus. 
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EXAMPLE 6 

Delivery of one or more molecules into cells in order to modify cellular processes 

The following experiment demonstrates how a cellular process may be 
modified in a cell by delivery to the cell of a VP22 fusion protein that contains as the 
5 fusion partner the small GTPase, rhoA, which is involved in the polymerization of 
actin microfilaments in mammalian cells. Previous studies have shown that when 
Swiss 3T3 cells are starved of serum for 16 hr, actin microfilaments involved in 
maintaining the shape of cells depolymerize, leaving soluble actin monomers. 
Addition of serum causes rapid repolymerization of actin, restoring the 
1 0 microfilaments. This effect has been produced by microinjection of cells with 
activated rhoA protein that has been expressed and purified from E. coli (A. Hall, 
Science 222:509-514, 1998). 

To test whether a VP22-rhoA fusion protein could generate a similar effect, a 
VP22-rhoA fusion protein was expressed and purified from E. coli using 

1 5 pCRT7/VP22- 1 -TOPO® plasmid. Swiss 3T3 cells were treated with the purified 

protein as follows: 3T3 cells were grown to approximately 50% confluence in 35 mm 
wells. Then the medium was removed and replaced with 1 ml of serum free medium. 
Cells were incubated for an additional 20 hr at 37°C. Approximately 1 jag of either 
purified VP22/rhoA or VP22/myc-His fusion protein was then applied to the cells. 

20 Twenty minutes later, cells were washed with 3 x 2 ml PBS, then fixed for 5 min in 
4 % formaldehyde (from Invitrogen p-galactosidase Staining Kit). Cells were washed 
again with 3 x 2 ml phosphate buffered saline (PBS), permeablized for 5 min with 
0.1% Tween-20® detergent in PBS, then washed again with 2 x 2 ml PBS. Cells 
were blocked for 30 min with 10% fetal bovine serum (FBS) in PBS before 

25 incubation with 0.1 jig/ml final concentration of FITC conjugated phalloidin (Sigma 
P-5282) in PBS/10% FBS for 30 min. 

Phalloidin binds to polymerized actin more strongly than to depolymerized 
actin, thus allowing for visualization of repolymerized microfilaments. Cells were 
washed again with 2 x 2 ml PBS prior to observation with an Olympus fluorescence 
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microscope and FITC filter. The purified fusion protein was applied to serum-starved 
3T3 cells. In cells that had been serum-starved and then treated with VP22/myc-His 
control fusion protein, no actin microfilaments could be detected and the cells 
appeared similar to serum-starved cells that had not been treated with either fusion 
5 protein. By contrast, in cells that had been treated with VP22-rhoA fusion protein for 
20 minutes, actin microfilaments could be clearly detected by binding of phalloidin. 
The distribution of actin microfilaments in cells that had been treated with VP22-rhoA 
fusion protein appeared similar to that seen in cells that had neither been treated with 
a fusion protein nor serum-starved. These results indicate that VP22 can be used to 
1 0 deliver a functional rho A fusion protein to cells. 

The wild type rhoA protein appears to stimulate polymerization of actin 
microfilaments from the cell membrane, but VP22 protein is normally transported to 
the cell nucleus. Since VP22/rhoA could stimulate the polymerization of actin 
microfilaments in a similar way, the distribution of VP22/rhoA protein was examined 
1 5 by immunofluorescence using an antibody against the myc epitope tag of the protein 
(Invitrogen). Most of the VP22/rhoA fusion protein could be detected in the 
cytoplasm of recipient cells and very little protein appeared to reach the nuclei. These 
studies show that VP22/rhoA protein may be retained at the sites of rhoA activity and 
not completely translocated to the nucleus. 

20 EX AM PLE 7 

Delivery of a VP22 fusion protein to a specific cellular compartment by 
modification of VP22. 

The following experiment demonstrates use of VP22 fusion protein to regulate 
distribution of the fusion partner within a specific cellular compartment. The HIV 
25 Rev protein (C. M Troy et al., Neuroscience i£:253-61, 1996) contains a leucine rich 
sequence that is sufficient to direct heterologous sequences out of the nucleus and into 
the cytoplasm. Furthermore, it has been shown that fusion of the Nuclear Export 
Signal (NES) to a heterologous protein that includes the canonical SV40 larger T 
antigen Nuclear Localization Signal results in distribution of the protein between the 
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cytoplasmic and nuclear compartments (W. Wen et al, Cell &2:463-473, 1995). 
Similarly, the Rev protein contains sequences for both nuclear import and export and 
is found in both the cytoplasmic and nuclear compartments (U. Fischer et al., Cell 
3£475-483, 1995). 

5 To test the ability of a translocating protein to deliver a fusion partner to a cell 

location other than the cell nucleus, in the present experiment, a fusion protein that 
consists of VP22/myc-His with the eleven amino-acid Rev NES inserted between the 
VP22 ORF and the myc epitope tag was expressed in E. coli, purified as described 
above, and applied to cells in culture. 

10 Distribution of the fusion protein among the cellular compartments in the cells 

in culture was examined by immunofluorescence as described above. The distribution 
of the fusion protein was verified by western blot analysis of treated cells, as follows: 
A suspension of 5 x 10 5 cells was transferred to 15 ml Falcon tubes. Cells were . 
recovered by centrifugation at 500 g for 5 min and then resuspended in 10 ml PBS. 

15 Cells were washed again in the same way, then resuspended in 1 ml serum- free 

medium containing approximately 10 ng VP22/GFP fusion protein, and incubated at 
37°C for 15 min. Following the incubation, cells were washed twice by 
centrifugation and resuspension in 10 ml PBS as before. Cells were recovered by 
centrifugation again and raised in 100 \i\ ice cold lysis buffer (10 mM HEPES-KOH, 

20 pH 7.9, 1.5 mM MgCl 2 , 10 mM, KC1, 0.5 mM DTT, 1% Triton X-100® detergent) 
and incubated on ice for 10 min. The lysate was centrifuged at 10000 g for 10 min. 
The supernatant, containing soluble cytoplasmic proteins, was removed and 
supplemented with 4 X protein sample buffer. The pellet, containing cell nuclei, was 
resuspended in 100 pi 1 X protein sample buffer. Samples were run on 4-20% 

25 SDS/PAGE gels (Novex) and transferred to nitrocellulose membrane. Western blots 
were probed with an anti-myc-HRP antibody conjugate (Invitrogen). These tests 
show that Rev NES adapted-VP22-containing fusion protein can distribute into the 
cytoplasm and nuclei of treated cells. 
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EXAMPLE 8 

Use of a VP22 fusion protein as a component of an inducible gene expression 
system. 

To test the theory that a translocating protein can be used in an inducible gene 
5 expression system with great specificity, a T7 RNAP/VP22 fusion protein was 

expressed and purified from E. coli using a protocol similar that described above for 
other VP22 fusion proteins. RNA polymerase activity was examined in an in vitro 
transcription assay. All reagents were from an in vitro transcription kit (Ambion, 
Austin, TX), and were used according to the manufacturer's instructions. The amount 
10 of RNA produced by the presence of the T7 RNAP/VP22 fusion protein was found to 
be similar to that of the T7 RNAP included in the kit. 

A reporter construct that contains a luciferase gene driven by a T7 promoter 
was also constructed. This construct was transfected into COS cells and 24 hours 
later purified T7 RNAP/VP22 fusion protein was applied to the cells. After an 

15 additional 24 hours, cell lysates were prepared and examined for luciferase enzyme 
activity using a luciferase assay kit (Promega) according to the manufacturer's 
instructions. Addition of T7 RNAP/VP22 fusion protein to cells transfected with the 
reporter gene resulted in five- to ten-fold increase above background in the level of 
luciferase expression, indicating that this system functions to control the expression of 

20 heterologous genes in eukaiyotic cells. 

EXAMPLE 9 

Covalent and non-covalent coupling to translocating proteins. 

Peptide or oligonucleotide molecules may be covalently conjugated to 
translocating proteins using Linx® chemical affinity system (Invitrogen) which uses 
25 low molecular weight chemical affinity ligands salicylhydroxamic acid (SHA) and 

phenylboronic acid (PBA). In this system, the low molecular weight chemical affinity 
ligands are used to form a Afunctional linker that attaches the translocating protein to 
a polynucleotide by means of a reversible pH-sensitive covalent bond. 
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Nucleic acid molecules containing PBA can be synthesized using PB A-NTPs 
(ProLinx, Seattle, WA) or, if double stranded, labeled with PBA- ATP using the 
enzyme terminal transferase. SHA-NHS ester can be used to attach SHA to lysine 
residues present in translocating proteins (Figures 3 A-C). The PBA-adapted molecule 
5 and the SHA-adapted translocating protein are then covalently linked and 

administered to cells. A full description of the procedures and conditions used to link 
proteins using this system is publicly available (Linx™ Rapid Protein Conjugation 
Kit, Catalog Nos K8050-01 to K8060-01, Invitrogen, San Diego, CA). 

EXAMPLE 10 

1 0 Assay for uptake of translocating protein: oligonucleotide conjugates 

A translocating protein and oligonucleotides of varying lengths can be 
conjugated and added exogenously to mammalian tissue culture cells. Single stranded 
DNA (ssDNA) of varying lengths containing PBA- ATP can be synthesized using 
PGR. A biotinylated 5' primer can be designed to allow purification of single- 

1 5 stranded molecules containing the PBA- ATP on a strepavidin column. A series of 3' 
reverse primers can be generated to facilitate the synthesis of a number of ssDNA 
molecules between 20 and 2000 nucleotides in length. The purified ssDNA 
molecules containing PBA will then be mixed with the translocating protein-SHA. 
Different concentrations of the protein: oligonucleotide conjugate can then added to 

20 cells and allowed to incubate for up to 4 hours. After incubation, the cells can be 

washed, fixed, and then probed using a strepavidin-FITC conjugate. Any internalized 
oligonucleotides will bind the strepavidin-FITC and be detected by fluorescence. It is 
expected that short oligonucleotides will be internalized very efficiently {i.e. 
delivered to 100% of the cells) and be concentrated within the nucleus. 

25 While the invention has been described in detail with reference to certain 

preferred embodiments thereof, it will be understood that modifications and variations 
are within the spirit and scope of that which is described and claimed. 
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WHAT IS CLAIMED IS: 

1 . A method for modulating a cellular process, said method comprising 
contacting a cell in culture under suitable conditions with a cell process-modifying 
molecule attached to a translocating polypeptide, whereby the cell process-modifying 
molecule is translocated into the cell in culture and interacts specifically therein with 
a target site responsive to the cell process-modifying molecule, thereby modulating a 
cellular process in the cell in culture. 

2. A method for transfecting a cell in culture with a target gene, said method 
comprising contacting the cell in culture under suitable conditions with a 
polynucleotide comprising the target gene attached to a translocating polypeptide, 
whereby the cell in culture is transfected by the target gene. 

3. The method according to claim 2 wherein the translocating polypeptide is a 
VP22 polypeptide, Antp, or Protein H. 

4. The method according to claim 2 wherein the translocating polypeptide is a 
VP22 polypeptide and the polynucleotide is translocated into the nucleus of the cell in 
culture. 

5. The method according to claim 2 wherein the polynucleotide is linear or 
circular DNA containing a cloned open reading frame that encodes the target gene. 

6. The method according to claim 5 wherein the polynucleotide is a supercoiled 
plasmid. 

7. The method according to claim 2 wherein the translocating polypeptide is 
attached to a DNA binding protein and the DNA binding protein links the 
translocating polypeptide to the polynucleotide. 
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8. The method according to claim 7 wherein the DNA binding protein is histone 
1 protein, high mobility group 17 protein (HMG17), a polylysine sequence, or an 
oligopeptide having at least three LARL repeats therein. 

9. The method according to claim 2 wherein the translocating polypeptide is 
attached to a nuclear export signal and the polynucleotide is transfected into the 
cytoplasm as well as the nucleus of the cell in culture. 

10. The method according to claim 9 wherein the nuclear export signal is derived 
from the HIV Rev protein or the heat stable inhibitor of cAPK. 

1 1 . The method according to claim 2 wherein the target gene is stably integrated 
into the genome of the cell in culture. 

1 2. A method for modulating expression of a target gene product in a cell in 
culture that contains a target gene under control of one or more regulatory elements, 

said method comprising contacting the cell in culture under suitable conditions 
with one or more regulatory agents attached to a translocating polypeptide, whereby 
the one or more regulatory agents are translocated into the cell in culture and interact 
therein with the one or more regulatory elements, thereby modulating expression of 
the target gene product by the cell. 

13. The method according to claim 12 wherein the cell in culture is a mammalian, 
yeast, insect or plant cell. 

14. The method according to claim 1 2 wherein the translocating polypeptide has 
the properties of: 

resistance to proteolysis, 

receptor-independent penetration of cell membranes, and 
energy- free penetration of cell membranes. 
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15. The method according to claim 12 wherein the translocating polypeptide is a 
VP22 polypeptide, Antp, or Protein H. 

16. The method according to claim 12 wherein the translocating polypeptide is a 
VP22 polypeptide. 

1 7. The method according to claim 12 wherein the regulatory agent is a 
polynucleotide, a protein or polypeptide, or a small molecule. 

1 8. The method according to claim 12 wherein the cell in culture is transfected 
with a polynucleotide comprising the target gene. 

19. The method according to claim 14 wherein the regulatory element is a 
promoter and translocation of the regulatory agent transactivates expression of the 
target gene product by the promoter. 

20. The method according to claim 19 wherein the regulatory agent is specific for 
the promoter. 

21 . The method according to claim 20 wherein the regulatory agent is a 
polymerase specific for the promoter. 

22. The method according to claim 2 1 wherein the polymerase is T7 RNA 
polymerase and the promoter is a T7 promoter. 

23. The method according to claim 12 wherein the regulatory agent is an HIV Rev 
protein and the regulatory element is the HIV Rev response element (RRE). 

24. The method according to claim 12 wherein the regulatory agent is a 
transcription factor specific for the regulatory element and translocation of the 
regulatory agent transactivates expression of the target gene product. 
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25. The method according to claim 12 wherein the regulatory agent and the 
translocating polypeptide are covalently attached. 

26. The method according to claim 12 wherein the regulatory agent and the 
translocating polypeptide are attached by a linker. 

27. The method according to claim 26 wherein the linker comprises one or more 
disulfide bonds, salicylhydroxamic acid (SHA), phenylboronic acid (PBA), a SHA- 
NHS ester, or a combination thereof. 

28. The method according to claim 12 wherein the translocating polypeptide and 
the regulatory agent are units of a fusion protein. 

29. The method according to claim 12 wherein the regulatory agent is a single 
chain antibody (sFv). 

30. The method according to claim 12 wherein the regulatory agent is a 
polynucleotide encoding a single chain antibody. 

3 1 . The method according to claim 1 2 wherein the translocating polypeptide and 
the regulatory agent are covalently linked by a biotin-streptavidin complex or the E. 
Coli single stranded DNA binding protein. 

32. The method according to claim 12 wherein the cell line contains a single 
genomic recombination site and a plasmid containing the target gene and a 
recombination site that pairs with the genomic recombination site, and wherein the 
one or more regulatory agents includes a recombinase specific for the paired 
recombination sites, and wherein translocation of the recombinase causes 
recombination between the paired recombination sites resulting in stable integration 
of the target gene into the genome of the cell at the genomic recombinase site. 
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33. The method according to claim 32 wherein the recombinase is Flp and the 
recombinase sites are frt recombination sites. 

34. The method according to claim 32 wherein the recombinase is Cre and the 
recombinase sites are lox recombination sites. 

35. The method according to claim 12 wherein the one or more regulatory 
elements includes a transcription-blocking sequence flanked by recombinase 
recombination sites and the regulatory agent is a recombinase specific for the 
recombination sites, wherein translocation of the recombinase causes recombination 
of the recombination sites, thereby modulating expression of the target gene product. 

36. The method according to claim 35 wherein the recombinase recombination 
sites ziefrt sites and the recombinase is Flp or the recombinase recombination sites 
are lox sites and the recombinase is Cre. 

37. The method according to claim 12 wherein the one or more regulatory agents 
include a single chain antibody specific for a component of the one or more regulatory 
elements, wherein translocation of the single chain antibody in to the cell and binding 
of the antibody to the component modulates expression of the target gene product. 

38. The method according to claim 12 wherein the target gene is a reporter gene. 

39. The method according to claim 12 wherein the target gene is contained within 
a polynucleotide that further encodes a protein tag. 

40. The method according to claim 12 wherein the target gene encodes a toxic 
protein. 

41 . The method according to claim 39 wherein the protein tag is a myc epitope, a 
fluorescent peptide, or a poly His tag, or a combination of any two or more thereof. 
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42. The method according to claim 12 wherein the contacting comprises mixing 
the mammalian or insect cell with an additional cell transfected with a polynucleotide 
that encodes the regulatory agent and the translocating polypeptide and the additional 
cell expresses the nucleotide to obtain the regulatory agent attached to the 
translocating polypeptide. 

43. The method according to claim 42 wherein the additional cell is prokaryotic or 
eukaryotic. 

44. The method according to claim 12 wherein the contacting involves incubating 
the cell line with a soluble protein lysate prepared from an additional transfected cell 
that expresses one or more polynucleotides encoding the regulatory agent and the 
translocating polypeptide. 

45. The method according to claim 44 wherein the regulatory agent and the 
translocating polypeptide are expressed by the additional cell as a fusion protein. 

46. The method according to claim 12 wherein the cell is refractory to other 
transfection techniques. 

47. The method according to claim 12 wherein the cell is a member of a cell 
population and expression of the target gene is induced in substantially all of the cell 
population. 

48. A vector comprising a polynucleotide encoding a cell process-modifying 
molecule attached to a translocating polypeptide. 

49. The vector according to claim 48 wherein the vector is has a nucleotide 
sequence according to SEQ ID NO:l. 

50. The vector according to claim 48 wherein the vector is has a nucleotide 
sequence according to SEQ ED NO:2 
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GACGGATCGGGAGATCTCCCG ATCCC CTATGGTCGACTCTCAGTACAATCTG CTCTGATGCCGCATAGTT 
AAGCCAGTATCTGCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGA 

ACAAGG CAAGG CTTG AC CG ACAATTG CATGAAGAATCTG CTT AGGGTT AGGCG TTTTGCG CTGCTTCGCG 
ATGTACGGGCCAGATATA03CGTTGACATTGATTATTGACTAGTTATTAAT^ 

ATTAGTTCATAG CCCATAT ATGGAGTTCCG CGTT ACAT AACTT ACGG T AAATGG C CCGC CTGGCTGACCG 

CCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTrTC 

ATTGACGTCAATGGGTGGACTATTTACGGTAAACrGCCCACTTGGCAGTACATCAAGTGTA 

AAGTACGCCCCCTATTQACGTCAATGACGGTAAATQGCCCGCCTGGCATTATGCCCAQTACA TO 

TGGGACTTTCCTACTTGGCAGTAGATCTACGTATTAGTCATC^ 

AGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGA^ 

TGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATO 

CAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCT 

CTG CTT ACTGG CTTAT CGAAATT AAT A CG ACT CACTAT AGGG AGAC C CAAGCTGGCTAGTT AAG CTT ATT 

ATGACCTTCrrcGCCGCTCCGTGAAGTCGGGTCCGCGGGAGGTTCCGCGCGATGAGT^ 

ACACCCCGTCrTCAGGTATGGCGAGTCCCGATAGTCCGCCTGA^^ 

ACGCTCGCX3CCAGAGGGGCGAGGTCCGTTTCGTCCAGTACGACGAGTCGGATTATGCCCT 

TCGTCTTCCGAAGACGACGAACACCCGGAGGTCCCCCGGACGCCX3CX3TCC^ 

CCGGCCCGGGGCCTGCGCGGGCGCCTCCGCCACCCGCTGGGTCCGGAGGGGCCGGACGCACACCCACCAC 
CGCCCCCCGGGCCCCCCGAACCC^GCCXKHraCGT^ 

CGCXXjCAGGAAATCGWCCCAGCCAGAATCCGCCGCACTCCCAGACGCCCCCT 
GATCCAAGACACCCGCGCAGGGGCTGGCCAGAAAGCTGCACTTTAGCA 

GCCATGGACCCCCCGGGTGGCCGGCITTAACAAGOT 
ATGCATGCCCGGATGGCGGCTGTCCAGCTCTGGOACATGTCGCGTCOT 

AACTCCTTGGCATCACCACCATCCGOGTGACGGTCTCC^^ 
GTTGGTGAATCCAGACGTGGTCCAGGACGTOIACGCGGCCACGGCGACT 
CGCCCCACCGAGCGACCTCGAGCCCCAGCCCGCTCCGCCT^ 
AGCTCGGATCCACTAGTCGAGTGTGGTGGAATTCT^C^U3ATA 
AGAGGG CCCGCGGTTCGAAGAAAAACTCATCTCAGAAGAGGATCT 

GCCCCrrcCCCCX3TGCCrTCCTTGACCCrrGGAAGGTGCCAC^ 
AATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTC^ 

GAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGG CTCTATGG CTTCTGAGGCGGAAAGAA 

CCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCGCATT 

TACGCGCAGCGTGACCXjCTACACTTGCCAGCGCCCTAGCGCCCGCT^ 

CTC^CGACGTTCGCCGGCTTTCCCCXjrCAAGCrCTA 

CTTTACXX3CACCTCGACCCCAAAAAACTTGATTAGGGTGATO 

GACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGCT 

ACACTCAACCCTATCTCGGTCTATTCTTTTGATTT^ 

AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTAATTCTC 

AAGTCCCCAGK3CTCCCCAGGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTA 

GGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATT 

TCCCGCCCCTAACTCraCCOVTCCCGCCCCTAACTCCGCCCMTTC 

ACTAATTTTTTTTATTTATGCAGAGGCCXIAGGCCX5CCT 

GGCTTTTTTGGAGGCCTAGGCrrTTTGCAAAAAGCT 

AAGAGACAGGATGAGGATCXnTTCGCATGATTaAACAAG 

GGTGGAGAGGCTATTTCGGCTATGACTXXK3CACAACAGACAATCGGCTC 

CTGTCAGCGCAGGGGCGCCCXKnTCTTTTrGTCAAGACOGACCT 

ACGAGGCAGCGCGGCTATCGTGK3CTGGCCAOGACGGGCGTTCCTTC 

TGAAGCGGGAAGGGACTGGCK3CTATTGGGCGAAGTGCCX3GGGCAGGATCT 

CCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG 

CATT<X»ACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGAT^ 

GGATGATCTGGACGAAGAGCATCAOGGGCTCGCGCCAGCCGAACTC 

CCCGACCK3CGAGGATCTCGTCGTCACCCATGGCGATGCCTGCT 

GCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCT 

CCGTGATATTGCTGAAGAGCTTGGCXK3CGAATGGGCTC 

CCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCT^^ 

G AAATGAC (X^CCAAGOTACXSCCG^CCTGCCATCACGAGATTTCGATTCCACCG CCG CCTTCTATGAAA 
GGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATCATC 

GTT CTT CG C C CACC C CAALTlXiTTT ATTG CAG CTTATAATGGTTACAAATAAAG CAAT AG CAT CACAAAT 
TTCACAAATAAAGCATTTTTTTCACTGCATTCT^ 

ATG TCTGTATACCGTCGACCTCTAGCTAGAGCTTGGCGTAATCATGGTCATAG CTGTTTCCTGTGTGAAA 

TTGTTATCCGCTCACAATTCCACACAACATACGAGCCGQAAQCATAAAQTC 

TGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACrro 

AGCTGCATTAATGAATCGGCCAACaCGaKXX^GAGGCGGTT^ 

GCrCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCT 

CGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCA 

ACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAA 
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GACGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAQTACAATCTGCTCTGATQCCGCATAGTT 
iUVGCCAGTATCTGCTCCCTGCTTGTGTGTTGGAOT 

ACAAGGCAAGG CTTGACCG ACAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCG CTC 
ATGTACGGGCCAGATATACGCGTTGACATTGATTATTCACT 

ATTAGTTCATAG CCCATATATGGAGTTCCG CGTT ACATAACTTACGGT AAATGGCCCG C CTGG CTOAC CG 
CC CAACGACC CCCGCC CATTGACGTCAATAATGACGTATGTTCC CATAGTAACGCCAATAGGGACTTTCC 
ATTGACOTCAATGGGTGGACTATTT ACX3GT AAACTG C CCACTTGGCAG TACATCAAGTGTATCATATG CC 
AART ACG C CC C CTATTGACGTCAATG ACGGTAAATGG CC CG CCTGGCATT ATG C CCAGT ACAT GAC C TT A 
TGGGACTTTC CT ACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT^ 
AGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGAT^ 
TGKjGAGTTTGTTTTGGCACCAAAATCAACGGGACT 

CAAAT<KK3CGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCTGGCT 

CTG CTTACTGG CTT AT CGAAATTAAT ACGACTCACT AT AGGGAG ACC CAAGCTGG CTAGTTAAG CTT ATT 

ATGACCTCTCGCCGCTCCGTGAAGTCGGGTCCGCGGGAGGTTC 

ACACCCCGTCTTCAGGTATGGCGAGTCCCGATAGTCCGCCTGACACCTCCCGCCGTXX5 

ACGCTCGCGCCAGAGGGGCXjAGGTCCGTTTCGTCCAGTACGACGAGTCGGATTATO 

TCGTCTTCCGAAGACGACGAACACCCGGAGGTCCCCCGGACGCGK3CGTCCCGTCT 

CCGGCCCGGGGCCrrcCGCGGGCGCCTCCGCCACCCGCTGGGTCCGGAGG^ 

CGCCCCCOSGGCCCCCCMAACCCAGCGGGTGGCGACT^ 

CGCGGCAGGAAATCGGCCCAGCCAGAATCCGCCGCACTCCCAGACGCCCCCGCGTCGACGGCGCCAACC 

GATCCAAGACACCCGCGCAGGGGCTGGCCAGAAAGCTGCACTTTAGC^ 

GCCATGGACCCCCCGGGTGGCCGGCTTTAACAA3CX3CGTCIT^ 

ATGCATGCCCGGATGGCGGCGGTCCAGCTCTGGGAGAT^^ 

AACTCCTTGGCATCACCACCATCCGCGTGACGGTCTGCGAGGGCAAAAACC^ 

GTTGGTGAATC CAGACGTGGTG CAGGACGTCGACGCGGCCACGG CGACTCGAGGGCGTTCTGCGGCGTCG 

CGCCCCACCXJAGCGACCTCGAGCCCC^GCarc 

AGCTCGGATCCACTAGTCCAGTGTGGTGGAATTOCOT^ 

GCGGCCGCTCGAGTCTAGAGGGCCCGCGGTTCOAACAAAAACTCATCTC^ 

TACCGGTCATCATGACCATCACCATTGAGTTT^ 

CAOCCATCTGTTGTTTGCCCCTCCCCCGTGCCrm 

CCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCAT^ 

GCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGC^ 

TCTGAGGCGGAAAGAACCAGCTGGGGCTCTAGGGGGTATCCCCACX3CGCCCT 

CGGCGGGTGTGGTGGTTACGCGCAGCXJTGACXXjCTACA 

TTTCTTCCCTTCCTTTCTCGC^ 

GGGTTCCGATTTAGTCCTTTACGGCACCTCQACCCCAAAA^ 
GGC«Ta3CCCTGATAGAOTGTTTTTOSCCCTT^ 
GTTCCAJUICTGGAACAACACTCAACCCTATCTCGGTCTACT 
TCGG CCTATTGGTTAAAAAATGAGCTGATTTAACAA 

TGAGTTAGGGTGTGGAAAGTCC CCAGGCTCCCCAGGCAGGCAGAAGTATG CAAAG CATGCATCTCAATTA 

GTCAGCAACCAGGTGTGGAAAGTC CCCAGG CTCC CGAGCAGGCAXJAAGTATGCAAAGCATGCATCTCAAT 

TAGTCAGCAACCATAGTCCCGCCCCTAACTCCXjCCCM 

CTCCGCCCCATGGCTCACTAATTTTTTTTATTTATC 

CCAGAAGTAXJIX3AGGAGGCTTTTTTGQAGGCCTAGGC^^ 

ATTTTCGGATCTGATCAAGAGACAGGATGAOGATCGTT^ 

GTTCTCCX3GCCX3CTTKX5GTGGAJBAGGCTA 

TGCOTCXXrrGTTCCGGCTOT^GOT 

CTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTCG 

TGCrCGACGTTGTCACTGAAGCGGGAAGGGACTGKSCTO 

GTCATCTCACCTTGCTCCTG CCGAGAAAGTATCGATCATGGCTGATGCAATGCGGCGGCTGC^ 

GATCCGGCTACCTGCCCATTCXjU^CCACCAAGCGAAAC^ 

CCGGTCTTGTCQATaVGQATGATCTGQACGAA^ 

GCTCAAGG CX3CG CATGCCCGACGGCX3AGK^TCTCGTCGTGACCCATGGCGATGCCTO 

ATGGTGGAAAATGGCCGCITTTCTGGATTCATCaACTC 

AC^TAGCGTTGGCTACCCGTGATATTGCrrGAAaAGCTTGGC^ 

TTACGGTATCGCCGCTCCCX3ATTCX3CAGCGCATCGCCTTCTA 

GGACTCTGGGGTTCGCXSAAATGACCGACCAAGCGACGCCCAACCrrc 

GCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTT^ CTGG ATG ATCCTCCAG CGCG 

GGGATCTCATGCTGGAGTTCTTCX3CCCACCCCAACTTGTTTATTX3 
CAATAGCATCAGAAATTTCACAAATAAAGCATTTT^ 
ATCAATGTATCTTATCATGTCTGTATACCGTCGACCTCTM 

AGCCTOGGOTX3CCTAATCAGTGAGCTAACTGACATTAATTO 

GGAAACCIX3TCGTCCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAG 

GCTCTTCCGCTTCCTCGCTCACTOACTCGCTC 

CTCAAAQOCGGTAATACGGTTATCCACAGAATCAQQGQATA ACXjCAG GAAAG^ 
CAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCX5TTGCTGGCGTTTTTCCA 
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AGCATCACAAAAATCGAOSCTCAACm^GAGGTGGCGAAArc 

^annAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAA 

AA^TG<S^^ 
CAGG^TCaWlXnX^CGCTCGTCa^ 

AGTTACATCATCCCCCATGlTGTGCAAAAAAGCGGTTAGCTCCriT^ 

^^^^^^^^^^^^^ 

S^^ACCCAACTGA^ 
TATTATTGAAGCATTTATCAGGGTTATTGTCTCA 

AACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
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